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PREFACE 

It is widely believed that statistics in general and design of 
experiments in particular is a difficult, dull, highly specialized subject 
to the studied and handted by professionals who are mathematically 
oriented. This is a big misconception which has grown and developed 
over time just by ‘ripple effect’. Actually, the subject is intersting, 
stimulating, revealing and above all near to reality as uncertainties 
are also taken care of while building models and drawing inferences. 
The need of the hour in research is to remove this misconception by 
putting forward practical examples, their solution in a simple way and 
eschewing mathematical jargon as far as possible. 

This book is an attempt to correct their misconception. So that 
design of experiments can be introduced to be used extensively 
among a larger audience. Such audience includes students of 
agriculture, biology, statistics, research methodology, social 
sciences, forestry, medical sciences, environmental sciences, animal 
sciences, veterinary sciences, business management and 
engineering sciences to larger extent. In order to achieve this 
objective the authors have adopted an expositional style with simple 
concepts, tools and use with many examples from agriculture and 
biological sciences but the concepts and treatment remains almost 
same while dealing with problems from other sciences in the 
application of various designs discussed in this book. 

The special feature of this book is its simplicity and the 
systematic style of presentation of the subject matter. The 
assumption and conditions on which various analysis and planning of 
the designs are based have been explained by means of simple, 
stepwise and straight strategy. The examples and theory have been 
taken from various books, manuals and reports for the benefit of 
students and teachers to comprehend and appreciate the actual 
problem faced in planning of experiments and analysis for inference. 
All such references have been listed at the end of book. Authours 
wish to acknowledge their input where ever used. Some hypothetical 
examples have also been considered to make the concept clear and 



practically oriented. We have also tried that the implication 

results emerging from analysis be discussed with simniil° the 

ifjMcity and 

thoroughness. 

The main reason for writing this book is to reach the wider 
audience which has been kept out as a result of the complexity of 
treatment of this subjected by many authors. Also, the other objective 
is to bring together many concepts and designs scattered in literature 
for the benefit of readers. Readers from even diverse fields like 
economics, engineering, industries, medical etc. can find the book 
handy and helpful for problem encountered by them during their 
research and decision making. The long experience of more than 
thirty years of authors while offering courses on design and analysis 
of experiments to post-graduate students and applied workers, 
teachers and scientists in the university have prompted to write a 
book on planning and analysis of design of experiments. The book 
contains enough material for a course extending to a year, as 
different kinds, both elementary and complex design have been 
discussed avoiding mathematical symbols while example have been 
solved and presented in steps so as to understand the intricacies of 
analysis. 

Many statistical packages are rapidly becoming available in the 
market as a routine resource in different institutes, colleges, 
universities and research centres in which a number of designs along 
with analysis are given which are being used sometimes unmindfully 
without consulting any competent statistician thus resulting in wrong 
inferences. Computers are unintelligent servants driven by programs. 
We can use any design on the same data and results will follow. It is 
the duty of experimenter/researchers to look into conditions and 
assumption and use the right design for right type of data. Hence 
statistical packages are to be used carefully with open eyes and 
mind. This book can be helpful in looking into various kinds of 
designs which may fit the required experiment and help the 
investigator for interpreting the results. 

This book covers a wide range of topics arranged in nineteen 
° apters ranging from basic principles, analysis of variance, 


transformation of data, single factor experiments, Latin square, 
factorial experiments, confounding, split and strip plot designs, 
covariance analysis, multi-locational experiments, BIBD, Laltice 
designs, augmented designs, block design analysis of diallel crosses, 
biological assays and simple way of finding expected value of mean 
square. Every care has been taken to put even the complex and 
intricate concepts in simple language for the under standing of reader 
may be student researcher or teacher in agricultural, engineering and 
biological sciences particularly. 

Thanks are due to many authors of reports, investigations and 
books for using the examples /data for illustrating the concept and 
analysis of various designs for the convence and help to various 
readers, investigators and researchers. We are also thankful to the 
Director AAREM, CCS HAU and Dean, COBS&H, CCSHAU, for their 
help, guidance and cooperation in bringing out this book. 

The author shall feel obliged for the suggestion from the readers 
in the improvement, modification and enhancement of contents, 
presentation and exposition of the book. 


Hisar 

18 th October, 2008 Deepak G rover 

Lajpat Rai 
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About the Book 

The book is an attempt to put together different designs and 
concepts for the benefit of students, teachers, researchers and 
investigators in a simple and non-mathematical way. Basic concepts 
of planning any design along with its analysis has been presented in 
steps so that even the novice may be able to understand the 
intericasies and get the analysis done in a proper way. Book contains 
nineteen chapters encompassing basic principles, analysis of 
variance, transformation of data, single factor experiments, Latin 
squares, Youden square designs, missing plot techniques, factorial 
experiments and confounding, spell and strip plot designs, 
covariance analysis, BIBD, Lattice designs, Augmented designs, 
block designs, sampling in field experiment, multl-locational 
experiments, analysis of diallel crosses, design and analysis of bio- 
assys have also been discussed at length along with examples. 
Finding expected value of mean square simple method has also 
been given. Optimality criterion for a design is also considered. The 
book shall the helpful for researchers and investigators in agriculture 
sciences, veterionary sciences, animal sciences, biological sciences, 
social sciences, medical sciences, engineering and environmental 
sciences also. 
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Chapter -1 


Basic Principles of Design of Experiments 


Scientists in the biological fields who are involved in research 
constantly face problems associated with planning, designing and 
conducting experiments. Basic familiarity and understanding of 
statistical methods that deal with such issues of concern would be 
helpful in many ways. Researchers who collect data and then look for 
a statistical technique that would provide valid results will find that 
there may not be solutions to the problem and that the problem could 
have been avoided first by a properly designed experiment. 
Obviously, it is important to keep in mind that we cannot draw valid 
conclusions from poorly planned experiment. Second, the time and 
cost involved in many experiments are enormous, and a poorly 
designed experiment increases such costs, time and resources. The 
following examples point out the necessity for a good design that will 
yield good results in research. First, a nutrition specialist in 
developing country is interested in determining whether mother’s milk 
is better than powdered milk for children under age one. The 
nutritionist has compared the growth of children in village A, who are 
all on mother’s milk against the children in village B, who use 
powdered milk. Obviously, such a comparison ignores the health of 
the mothers, the sanitary conditions of the villages, and other factors 
that may have contributed to the difference observed without any 
connection to the advantages of mother’s milk or the powdered milk 
on the children. A proper design would require that both mother’s 
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milk and the powdered milk be alternatively used in both villages, or 
some other methodology to make certain that the differences 
observed are attributable to the type of milk consumed and not to 
some uncontrollable factor. Second, a crop scientist who is 
comparing two varieties of maize, for instance would not assign one 
variety to a location where such factors as sun, shade, unidirectional 
fertility gradient, and uneven distribution of water would either favour 
or handicap it over the other. If such a design were to be adopted, 
the researcher would have difficulty in determining whether the 
apparent difference in yield was due to varietal differences or 
resulted from such factors as sun, shade, soil fertility of the field, or 
the distribution of water. The two examples illustrate the type of 
poorly designed experiments that are to be avoided. 

• Good experimental designs are product of the technical 
knowledge of one’s field, an understanding of statistical 
techniques and skill in designing experiments. Any research 
endeavor may entail the following phases: 

• Concept 

• Design 

• Data collection 

• Analysis 

• Dissemination 

Statistical methodologies can be used to conduct better scientific 

experiments if they are incorporated into entire scientific process, i.e., 

from inception of the problem to experimental design, data analysis 
and interpretation. 

When planning experiments, we must keep in mind that large 
uncontrolled variations are common occurrences. Experiments are 
g nerally undertaken by researchers to compare effects of several 
of rt’ 08 ° n SOme P^ en °mena or in discovering an unknown effect 
ohen CU ^ r ^ rocess ' experiment facilities the study of such 
controi^H* 18 Under contr °lled conditions. Therefore, creation of 
onditions is the most essential characteristics of the 
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experimentation. How we formulate our questions and hypotheses 
are critical to the experimental procedure that will follow. For 
example, a crop scientist who plants the same variety of a crop in a 
field may find variations in yield that are due to periodic variations 
across a field or some other factors that the experimenter has no 
control over. The methodologies used in designing experiments will 
separate with confidence and accuracy a varietal difference of crops 
from the uncontrolled variations. 

The above discussion brings some salient features of 
experimentation: 

(i) All the extraneous variation in the data should be eliminated or 
controlled excepting the variation due to the treatments under 
study. One should not artificially provide circumstances for one 
treatment to show better results than others. 

(ii) For a given size of the experiment, though the experiment can be 
done in many ways, even the best results may not turn out to be 
significant with some designs, while some other designs can 
detect the treatment differences. Thus there is an imperative 
need to choose the right type of design, before the 
commencement of the experiment. 

(iii) If for some specific reason related to the nature of experiment, a 
particular method has to be used in experimentation, then 
adequate number of replications for each treatment have to be 
provided in order to get valid inferences. 

(iv) The treatment have to be randomly allocated to the experimental 
units. 

The Design of an experiment is a complete sequence of steps 
taken before the experiment to ensure that appropriate data will be 
obtained, data which permit an objective analysis leading to valid 
inferences with respect to problem at hand. The individual who 
formulates the design should clearly understand the objective of the 
investigation. 
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Basic Principles of Design of Experiments 


Given a set of treatment which can provide information regarding 
the objective of an experiments, a design for the experiment defines 
the size and the number of experimental units, the manner in which 
treatments are allotted to the units and also appropriate type and 
grouping of the experimental units. These requirements of a design 
ensure validity, interpretability and accuracy of the results obtained 
from an analysis of the observations. 


These purposes are well served by the following three principles 
given by Prof. Ronald A. Fisher and are Known as the Basic 
Principles of Design of Experiment. The three principles are: 


(i) Replication 


(ii) Randomization 


(iii) Local control 


Replication 


If a treatment is allotted to r experimental units in an experiment, 
it is said to be replicated r times. If in a design each of the treatments 
is replicated r times, the design is said to have r replications. 
Replication is necessary to: 

• Increase the accuracy of estimates of the treatment effects. 

• Provide an estimate of the error variance which is a function of 
the differences among observations from experimental units 
under identical treatments. 

Though, the more the number of replications the better it is, so 
far as precision of estimates is concerned, it cannot be increased 
infinitely as it increases the cost of experimentation. Moreover, due to 
limited availability of experimental resources too many replications 
cannot be taken. 


The number of replication is therefore decided keeping in view 


the permissible expenditure and the required degree of precision. 
Sensitivity of statistical methods for drawing inference also depends 
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on the number of replications. Sometimes this criterion is used to 
decide the number of replications in specific experiments. 

Determination of number of replications 

Error variance provides a measure of precision of an experiment, 
the less the error variance the more precision. Once a measure of 
error variance is available for a set of experimental units, the number 
of replications needed for a desired level of sensitivity can be 
obtained as below. 

Given a set of treatments an experimenter may not be interested 
to know if two treatment differ in their effects by less than a certain 
quantity, say, d. in other words, he wants an experiment which should 
be able to differentiate two treatments when they differ by d or more. 

We know that the significance of the difference between two 
treatments is tested by t-test where 

t _ . Vi ~ Y, 

V2s 2 /r 

Here y ,, and yj are the arithmetic means of two treatment 
effects each based on r replications, s 2 is measure of error variation. 

Given a difference d, between two treatment effects such that 
any difference greater than d should be brought out as significant by 
using a design with r replications, the following equation provides a 
solution of r. 

Idl 

t o= 1 = ■ 

V2s 2 /r 

t 2 

=> r = -±-x2s 2 ... (i.i) 

where t 0 is the critical value of the t-distribution at the desired 
level of significance, that is, the value of t at 5 or 1 percent level of 
significance read from the t-table. If s 2 is known or based on a very 
large number of observations, made available from some plot pre- 




degree of freedom. 

When the number of replication is r or more as obtained above 
then all differences greater than d are expected to be brought out 
significant by an experiment when it is conducted on as set of 
experimental units which has variability of the order of s 2 . 


For example, an experiment on wheat crop was conducted in a 
seed farm in Bhopal to study the effect of application of nitrogen and 
phosphorus on yield in a randomized block design with three 
replications. There were 11 treatments two of which were (i) 60kg/ha 
of nitrogen (ii) 120kg/ha of nitrogen. The average yield figures for 
these two application of the fertilizer were 1438 and 1592 kg/ha 
respectively and it is required that differences of the order of 150 
kg/ha should be brought out significant. The error mean square (s 2 ) 
was 12134.88. Assuming that the experimental error will be of the 
same order in future experiments and t 0 is of the order of 2.00, which 
is likely as the error d.f. is likely to be more than 30 as there are 11 
treatments, Substituting in (1.1); we get: 


r = 


2tpS 2 2x2 2 xl 2134.88 


150' 


= 4(approx). 


Thus, an experiment with 4 replications is likely to bring out 
differences of the order of 150kg/ha as significant. 

Another criterion for determining r is to take a number of 
replications which ensures at least 10 d.f. for the estimate of error 
variance in the analysis of variance of the design concerned since 
the sensitivity of the experiment will be very much low as the F test 
(which is used to draw inference in such experiment) is very much 
unstable below 10 d.f. 

Randomization 

As discussed earlier, by replication the experimenter tries to 




average out as far as possible the effects due to uncontrolled factors. 
This brings to him the question of allocation of treatments to 
experimental units so that each treatment gets an equal chance of 
showing its worth. In the absence of the prior knowledge of the 
variability of the experimental material, this objectives is achieved 
through “randomization”, a process of assigning the treatments to 
various experimental units in a purely chance manner. The following 
are the main objectives of randomization. 


(i) The validity of the statistical tests of significance, e.g., the t -test 
for testing the significance of the difference of two means or the 
‘Analysis of Variance’, F-test for testing the homogeneity of 
several means, depends on the fact that the statistic under 
consideration obeys some statistical distribution. Randomization 
provides a logical basis for that and makes it possible to draw 
rigorous inductive inferences by the use of statistical theories 
based on probability theory. This assumption of randomness is 
necessary since S.E.^ = cr/Vnfor random sampling only. 

Randomising the treatments over the experimental units is an 
essential safeguard against distortion of experimental results by 
un-anticipated influences such as rise in ambient temperature, 
drift in calibration of instruments and equipment, fertility of the 
soil or other systematic changes. 

(ii) The purpose of randomness is to assure that the sources of 
variation, not controlled in the experiment, operate randomly so 
that the average effect of any group of units is zero. In other 
words, randomization ensures that different treatments, by the 
repetition of the experiment, on the average are subject to equal 
environmental effect. Randomization eliminates bias in any form. 
It equalises even factors of variation over which we have no 
control. 


It should be noted that randomization without replication is not 
sufficient. It is only when randomization of treatments to various units 
is accompanied by an adequate number of replications then we are 
in a position to apply the tests of significance (t-test or F-test) 
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Local Control 

If the experimental material, say field for agriculture 
experimentation, is heterogeneous and different treatments are 
allocated to various units (plots) at random over the entire field, the 
soil heterogeneity will also enter the uncontrolled factors and thus 
increase the experimental error. It is desirable to reduce the 
experimental error as far as practicable without unduly increasing the 
number of replications or without interfering with the statistical 
requirement of randomness, so that even smaller differences 
between treatments can be detected as significant. In addition to the 
principles of replication and randomization discussed earlier, the 
experimental error can further be reduced by making use of the fact 
that neighbouring areas in a field are relatively more homogeneous 
that those widely spread. In order to separate the soil fertility effects 
from the experimental error, the whole experimental area (field) is 
divided into homogeneous groups (blocks) row-wise or column-wise 
(one-way elimination of fertility gradient, c.f. Randomized Block 
Design) or both (elimination of fertility gradient in two perpendicular 
direction c.f. Latin Square Design, according the fertility gradient of 
the soil such that variation in each block is minimum and between the 
block is maximum. The treatments are then allotted at random within 
each block. Then the process of reducing the experimental error by 
dividing the relatively heterogeneous experimental area in to 
homogeneous blocks is known as local control. Various forms of 
arranging the units (plots) in to homogeneous groups (blocks) have 
so far been evolved and are known as experimental designs, e.g. 
Randomized Block Design, Latin Square Design exc. 

Planning Consideration 

In adding to replication, randomization and local control a 
other points should be given consideration in planning f ,e 
experiments. Some of the important steps are as under:, 

1 • Objectives of the experiment 

2. Scope of the experiment 

3. Formulation of the Hypothesis 
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4. Selection of treatments 

5. Selection of site and experimental material 

6. Selection of plot size, shape and blocks 

7. Determination of number of replications. 

8. Choice of an experimental design. 

9. Transformations 

10. Statistical analysis 

11. Drawing of conclusions. 

Objective of the experiment 

The preparation of a written statement of the objectives, listed in 
order of their priority will often result in an more concise and realistic 
experiment. 

Scope of the experiment: The proper application of the scope 
of the experiment presumes a knowledge of the agricultural 
background of the problem and it is desirable that agricultural 
aspects of the experimental design should receive full consideration 
alongwith the statistical aspects before it is finally settled. 

Formulation of the hypothesis 

Keeping in view the objectives of the experiment, the researcher 
has to formulate certain hypothesis alongwith level of confidence 
which after the application of certain test of significance is to be 
accepted or rejected. 

Selection of Treatments 

The treatments to be used in an experiment are determined by 
the objectives. They are selected to answer the questions raised by 
the problem and the specific objectives. If the objective of the 
experiment is to evaluate new selections in a breeding program, the 
treatments are then new selections along with one or more standard 
varieties against which the new selections can be compared. 

Site and experimental materia! 

The choice of the experimental site affects the conduct of the 
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experiment and the interpretation of the results in several ways. It 
determines the population to which the results of the experiment will 
apply. Thus the population to which the information is wanted must 
be defined carefully and the site must be one that represents the 
population. The selected material must be homogenous and 
representative of the population for which results are to be made 
applicable. 

Selection of plot size and shape and blocks 

A uniformity trail is conducted to know the nature and magnitude 
of soil heterogeneity and to determine the optimum size and shape of 
plots and blocks. At the time of harvest the whole field is divided in to 
smaller units and the yield of these units are individually recorded. 
Any variation in yield is attributed to soil heterogeneity. On the basis 
of individual plot yield, a contour map can be prepared by joining the 
lines of equal fertility. There are several methods of determining the 
optimum plot size, some of them are (i) maximum curvature method 

(ii) Heterogeneity index method Smith (1938) etc. 

Number of Replications 

The number of replications serves the double purpose i.e. it 
provides an estimate of the error as well as it is the biggest orce o 
reducing the experimental error. The deciding factors for determining 
the number of replications are: 

(i) Degree of precision required. 

(ii) Amount of variability present in the material. 

(iii) Available resources. 

(iv) Size and shape of experimental units. 

There are many methods available for determining the optimum 

number of replications, however as layman the number of 

replications should be such that it provides atleast 10-12 degree of 
freedom for error. 

Choice of an experimental design 

are a number of factors that affects the choice of 
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experimental design, however a cardinal rule "choose the simplest 
experimental design that will give the required precision within limits 
of the available resources. 

There are three main steps in choosing the design: 

(i) It has to be decided whether the design is unifactor or factorial. 

(ii) It has to be decided whether to group the observation to 
eliminate one, two or more levels of heterogeneity (causes of 
variations). For instance, it may be desired to eliminate 
simultaneously the effect of time and day of taking the 
observations. A Latin square design e.g.: 



Day 1 

Time 

1 

2 

3 

4 

1 

A 

B 

C 

D 

2 

B 

C 

D 

A 

3 

T c 

D 

A 

B 

4 

D 

A 

B 

c 

Illustrates 

one method 

of doing this, 

since each 

-- 

treatment is 


observed once each day and once at each time of the day. 

(ifi) If the no. of treatment combinations is too large to be adjusted in ' 
a single block, the design will be referred to as an “incomplete 
block design” otherwise as “complete block design”. 

Following table indicates the types of design that might be used 
for unifactor or factorial experiments in complete or incomplete 
blocks eliminating one or two ways of heterogeneity. 

Table 1: Classification of main designs 


Complete 


Unifactor 

Factorial 

block 

One grouping 

Two grouping 

Completely randomized 
Randomized block 

Latin square 


One grouping 

Balanced 

Incomplete block 

Confounded design 

Incomplete 


Partially balanced 

Fractional 

block 

Two grouping 

and cyclic designs 

replication, split 
plot design 


Youden squares 

Quasi-Latin 



Lattice squares 

squares 




20 


Experimental Designing and Data Analysis 


Transformation 

Sometimes a situation arises when a straight forward analysis of 
variance of data is not valid. This happens when the observation 
refer to small wrole numbers. This difficulty can be overcome by 
transforming the data on a suitable scale and analyzing the 
transformed values. 

Statistical analysis 

When the experiment has been completed, the data should be 
subjected to statistical analysis. The first step is to prepare an 
ANOVA Table. Once the ANOVA has been completed, the data are 
subjected to additional analysis to proved specific information on the 
questions raised by the objectives of the experiment. 

Drawing of conclusion: The results are to be interpreted and 
put up in the form of a report. 

Terminology 

Experiment: An experiment is a device or a means of getting an 
answer to the problem under consideration. Experiment may be 
absolute or comparative. 

Absolute experiments: Here we determine the absolute value 
of some characteristic like (i) obtaining the average Intelligence 
Quotient (IQ) of a group of people (ii) finding the correlation 
coefficient between two variables in a biveriate distribution etc. 

Comparative experiments: Here the effect of two or more 
objects on some population characteristics, e.g comparison of 
different manures and fertilizers, different kinds of varieties of crop, 
different cultivation processes, different pieces of land in a field 
experiment, or different diets or medicines in a dietary or medical 
experiment respectively are studied. 

Treatment 

atment refers to controllable quantitative or qualitative factor 
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imposed at a certain level by the experiment. For an Agronomist 
several fertilizers, concentrations applied to a particular crop or a 
variety of crop is a treatment. Similarly, an Animal Scientist looks 
upon several concentrations of a drug given to animal species as a 
treatment. In agribusiness we may look upon impact of advertising 
strategy on sales of a departmental store. To an agricultural 
engineer, different levels of irrigation may constitute a treatment. For 
a plant breeder different varieties are treatments. 

Experimental unit 

Experimental unit may be looked upon as a small subdivision of 
the experimental material, which receives the treatment. For a 
agronomist or horticulturist it may be plot of a land or batch of seed, 
for an animal scientist it may be a group of pigs or sheep, for a 
scientist engaged in forestry research it may be different tree species 

occurring in an area, and for an agricultural engineer it may be 
manufactured item. 

Experimental errors 

Differences in yields arising out of experimental units treated 
alike are called experimental errors. 

To the statistician an experiment error reflects errors of 
experimentations, errors of observations, errors of measurements 
the variation of experimental material and the combined effects of 
extraneous factors which are not singled out for attention in the 
investigation. 

Experimental error may often be reduced by the use of more 
homogeneous experimental material, careful stratification of available 
material, greater care in conducting the experiment, more efficient 
experimental designs, related information provided by other sources 

Yield: The measurement of a variable under study on different 
experimental units (e.g. plots in field experiments) are termed as 
yields. 

Replication: Replication measures the execution of an 
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experiment more than once. In other words, the repetition of 
treatment under investigation is known as replication. 

Precision: The reciprocal of the variance of the mean is termed 
as the precision or the amount of information of a design. Thus for an 
experiment repeated r times the precision is given by: 

1 r 

—-— = — where a 2 is the error variance per unit 
var(x) a 2 

Efficiency of a design 

Consider two designs and D 2 with error variance per unitaj 
and o\ and replications n and r 2 respectively. Then the variance of 
difference between two treatment mean is 

— L and—— for designs Di and D 2 . 
r 1 r 2 

Then efficiency of design Di w.r.t. D 2 
_ Amount of information in Di _ q r 2 
Amount of information in D 2 a 2 a 2 

If E = 1 then both the designs are said to be equally efficient 

If E >1 (E<1) the Di is said to be more (less) efficient than D 2 

An efficient design has a greater ability of detecting the 
differences of treatment effects, so the efficiency of a design can be 
increased by 

(i) decreasing a 2 , the error variance per unit. This is done by 
arranging plots in to small homogeneous blocks and 

(ii) increasing r, the number of replication 

Factors 

Experimental variable are termed as a factor. For example, a 

fertilizer, a new feed ration, and a fungicide are all considered as 
factors. 
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Basic Principles of Design of Experiments 

Level of a factor 

Level of a factor is the number of ways in which a factor is varied 
in the experiment. When a factor is varied in p ways we refer it as 
having p levels e.g. 2 modes of an application such as drip or 
sprinkle irrigation, is referred to as 2 level factors. Application of 
Nitrogen (N) to crop as N 0 = 0 N 1= 60 kg/ha, N 2 = 120 kg/ha is 
referred to as 3 level factors. 

Factorial Experiment 

An experiment is called factorial experiment if its treatments are 
combination of levels of different factors. 

Uniformity trails 

To conduct a uniformity trail the area is planted uniformly to a 
single crop. Cultural practices are conducted throughout the trial in as 
uniform way as possible. The trial is then partitioned into small units, 
or plots. These are harvested individually and the individual yields, 
together with the location of the plot from which they came, are 
recorded. These can be used to construct a yield map of the site. 
Such a map can be helpful in deciding the layout of the field plan. It 
can also be used in determining the shape and orientation of the 
plots and in grouping the plots into blocks of similar plots. Further 
information from a uniformly trial can be used to determine the 
optimum size of plots as well. 

Agronomy trials 

Agronomy trials include fertilizer studies, time, rate and density of 
planting, tillage study, and pest and weed control studies. Because 
the response to changes in the level of one factor is often conditioned 
by the level of other factors it is almost essential that the treatments 
in agronomy trials include combination of multiple levels of two or 
more production factors. In general the plot size for agronomy trials is 
large than that of variety trials. This is because of the size of 
machinery involved in applying the treatments and because of the 



24_ Experimental Designing and Data Analysis 

border effects introduced, in part, by the use of machinery. 


Variety trials 

Variety trials form the backbone of a plant breeding programme. 
They range from the initial studies involving small plots (single plant 
or single row) to replicated yield trials involving fairly large plots. 

Blocking 

The procedure here is to group the plots or experimental units 
into blocks or groups of homogeneous units. Treatments are then 
assigned at random to the plots within the blocks. 

For example, the field under consideration for an experiment 
might contain two types of soil, we could select plots on each soil 
type and randomly assign treatments to plots within soil types. In this 
case soil type would be a blocking factor and the difference between 
soil types would be removed from the experimental error in the 
analysis of data. The key to successful blocking is to maximize the 
variation among blocks and to minimize the variation among plots 
within blocks. 

Blocks and replications are not the same thing. A block is simply 
a group of plots, whereas a replication is a repetition of a treatment in 
an experiment. 


a □ n 


Chapter - 2 


Analysis of Variance and Tests for 
Comparison between Treatment Means 


The technique used for the analysis and interpretation of data 
collected through designing an experiment is the analysis of variance 
(ANOVA) which essentially consists of partitioning the total variation 
present in the data into meaningful components. There are mainly 
three kinds of variation present in the data so generated due to (i) 
treatments (ii) environmental (iii) error. Any variation which arises due 
to the factors introduced by the experimenter intentionally into the 
experiment will be known as variation due to treatments, while the 
variation in the experimental material is called environmental 
variation. The uncontrolled variation, which arises due to random 
fluctuations in the data will be labelled as random error and 
importantly provides a yard stick for testing the treatment and 
environmental variations. The partitioning of the total variation into 
respective components will be achieved by suitably assuming a 
statistical model for the observational data in terms of certain 
unknown parameters. 

The parameters in the analysis of variance model may either be 
fixed constants or random variables. Depending on the nature of the 
parameters. Eisenhart classified the analysis of variance models into 
three categories as follows: 

An analysis of variance model is a fixed effects model (or 
Eisenhart’s model - I) if all the parameters are constants; random 
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effect model (or Eisenhart’s model - II) if all the parameters 
excepting the general mean are random variables, or mixed effect 
model (or Eisenhart’s model - III) if parameters other than general 
mean are both constants and random variables. In problems where 
the experimenter will be confining his attention just to the treatments- 
applied, fixed effects model will be applied. However, to draw 
inferences beyond the applied treatments, random effects model will 
be used and in such cases the treatments and/or experimental units 
will be randomly chosen from the population of treatments and/or 
experimental units to which inferences are proposed to be made. 

In fixed effect models, the main objectives are to estimate the 
effects, find a measure of variability among the effects of each of the 
factors and finally find the variability among the error effects. In 
random effect models, the main emphasis is on estimating the 
variability among the effects of the different factors. The methodology 
for obtaining expressions of variability is, however, mostly the same 
in the different models, though the methods appropriate for their 
testing are different. 


1. Fixed Effects Model 


One way classification: Let a factor A be experimented with s 

levels on n homogeneous experimental units, i ,h level being used on 

\ 

s 


( 


a units 


n = Inj 
i = 1 ) 


. If yj is the response from the j th experimental 


unit to which the i th level is given, we can assume. 


yij = p+ <Xj +ejj, j = 1, 2., a; i =1,2 .,s 

where p is the general mean, a, is the effect of i th level of factor A 
and ejj‘s are random errors. Assuming that the errors are 
independently and normally distributed with mean zero and constant 
variance a 2 . The analysis of variance table partitioning the total 
variation into two components will be obtained as given in ta£>le 1.1. 

The null hypothesis to be tested in problems of this nature is H 0 : 
a 2 = a 2 =.a s . That is, the effects of s levels are equal against 
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the alternative hypothesis that not all iXj’s are same. If the calculated 
F value in the above Anova table exceeds F s . 1)n . Si u then we reject H 0 
and conclude that the effects of all the levels are not equal at a level 
of significance; otherwise not. 


Table 1.1: Anova table for one-way classification 


Source 

d.f. 

S.S. 

M.S. 

F (calculated) 

Factor A 

s-1 

1 Vi 2 y . 2 

i = 1 n i n 

ms a 

MS A /MS e 

Error 

n-s 

By subtraction 

MS e 


Total 

n-1 

2 y 2 




Multiple Comparison Tests 


When the treatment differences are established from the Anova 
table, one will be interested to isolate the treatments giving 
significantly different response than others. This could be achieved 
by multiple comparison methods and we briefly discuss some of the 
commonly used tests. 


(i) Least significant difference (Isd): Using this test one 
concludes that two treatments are significantly different if the 
difference in their mean response exceeds the Isd value, which is the 
product of the tabulated t-value at the chosen level of significance 
based on error d.f. with the standard error of the difference of the 
treatment means. The Isd value for comparing the ith and jth levels of 
f actor A which a re applied to nj and nj experimental units is (t n . s , 2 )• 


( 


MS, 




A 


n 


n 


i ; 


. The Isd value is also known as critical difference 


(CD). When all the levels are equally replicated, say r times, then Isd 
v alue for co mparing any two different treatment means is (t n _ s , (1j2 ). 

V 2M S e /r . A bar diagram can be drawn which shows all effects 

which do not differ among them significantly under one bar. 

Since the Isd procedure is equivalent to making many t-tests, this 
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method comparing different treatment means is also called multiple t- 
test technique. 

(ii) Scheffe’s Multiple Comparison Test: With this test any two 
treatments will be judged significant at cx level of significance if the 

difference between their means exceeds {(s -1) F s _ x n _ S (X j 2 times 


the standard error of the difference in their means. In other words, 
the ith and jth levels of factor A will have significant different effects at 
cx level of significance if 


Yi-Yj >{(s-1)F s _ 1in _ s , tt } 2 jMS ( 


1 1 

— + — 


A 


v n i 


n 


j ) 


Schelffe’s multiple comparison tests are more conservative than 
multiple t-tests and fever treatment differences will be detected by 
this procedure than Isd test. 

(iii) Dunnett’s Multiple Comparison Test: When each of s-1 
levels of factor A have to be compared with a control level of A, 
Dunnet proposed an alternative method rather than Isd. To perform 
this test, we find t' value from the table. We read the column headed 
by p = s-1 and row corresponding to error d.f. and choose P = .95 or 
.99 according as the level of significance a used is .05 or .01. Then 
the i 1h level of factor A is significantly different from the control level of 
A if the absolute value of the difference in the mean values exceed 
t' a ^/2MS e /r , where r is the equi-replications of all levels of factor 

A. 


(iv) Newman-Keuls Range Test: The treatment means 
Yi/Y2/.Vs Will be ranked from highest to lowest as 

Xg.Xg.i x a . We now use the table. In the table, we enter 

s, error d.f. = n-s and protection level is the level of 
significance and the entry may be denoted by q s , n - S/ «• Now 

we compute the upper a percent point of studentized critical 
ranges 

W s 


(ds>n-s, «) VMSe/r 
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w s-i = (qs-i.n-s.oc) /MSe/r 
Compare (x s - x x ) with W s 

•f ( x s " x i) < W s , we conclude that at a level of significance that 

all treatments are not significantly different and the process of 
comparison stops. However, if (x s - x x ) > W s , we divide 

x s» x s-i .. x i into two subgroups one containing x s ,...., x 2 and 

the other from x s . 1 .. x 1 . We compare each of (x s - x 2 ) and 

( x s-i ' x i) Ws.t. If either range does not exceed Ws.!, then 

the means in each of the groups are equal. If either or both ranges 
exceeds W s .!, then the s-1 means in the group concerned are divided 
into two groups of s-2 means each and the ranges for these 
subgroups are compared with W s . 2 . This procedure is continued until 
a group of i means is found whose range does not exceeds Wj. In 
other words, by this method the difference between any two means is 
significant when the range of the observed means of each and every 
subgroup containing the two means under test is significant 
according to studentized critical range. 

(v) Duncan’s Multiple Range Test: The treatment means are 
ranked highest to lowest. Let these are Y s , Y^.Y!, now 

find significant studentized ranges at 5% or 1% for multiple range test 
for various means and at (n-s) degree of freedom from the tables 
given in the books under the heading “Significant studentized ranges 
for 5% and 1% level Multiple Range Test’. 

Let it be denoted as (d p , n-s, a) 

Compute D s = (d s , n-s, a) VMSE/r 

D s .i = (d s .i, n-s, a) VMSE/r. 

We then compare Y s -Yi with D s . If Y s -Y! < D s , then we 

conclude that all the treatments do not differ significantly at a given 
level of significance otherwise differ significantly. However if Y s - Y, > 

D s . then we compare each of Y s - Y 2 and Y^ - Y, with D^. If either 

range is less than D s _i then the treatments in that set are not 
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significantly different. If either range exceeds D s . 1f then s-1 means 
are divided into two sets of s-2 means each and compared with D 3 _ 2 . 
The process continues till a subset of non significant means are 
obtained. 


As an illustration, let us consider that 4 different feeds were 
each fed to a lot of 5 baby chicks and the gain in weight (in kg) be as 
given below: __ 


Feed 

Gain in weight (in kg) 

Total 

A 

2.75, 2.45, 2.10, 1.05, 2.60 

10.95 

B 

3.05, 5.60, 1.50, 4.45, 3.15 

17.75 

C 

2.10, 4.85, 4.05, 4.75, 4.60 

20.35 

D 

8.45, 6.85, 8.45, 4.25, 7.70 

35.70 



84.75 


Correction Factor 


(84.75) 2 
20 


359.13 


Total sum of squares = (2.75) 2 + (2.45) 2 +.(7.70) 2 - 359.13 

= 94.48 


Feeds sum of squares = 

17 752 + 20352 + jlZgl -359.13 = 65.59 
5 5 5 5 

Error S.S. = 94.48-65.59 = 28.89 


Anova Table 


Source 

d.f. 

S.S. 

M.S. 

F 


Feeds 

3 

65.59 

21.86 

12.1 


Error 

16 

28.89 

1.81 



Total/ 

19 

94.48 





The table F 3> 16 0 . 5 = 3.24. As the calculated F value exceeds the 
table value, we conclude that the feeds produce different gains in 
weights at 0.05 level of significance. 
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Multiple Comparison Tests 

Least significant difference 

Isd or CD value = (t n . s , oJ2 ). 

The Isd value at 0.05 level of significance for comparing the 
difference of means of weights for any two feeds is 2.12 
V2x 1.81/5 =1.80. 

The difference in means for feeds A and D have absolute value 
4.95 which exceeds the Isd value, we conclude that feeds A and D 
produce significantly different gain in weights in baby chicks at 0.05 
level of significance. 

Scheffe’s Multiple Comparison Test 

Two means will differ significantly if 

1 | 77 77 

Vi - y 1 1 > «s -1) F s _,. n . so } * Ims. -+— 

\ l n i n ij 

In our example RHS = (3 x 3.27 )* _ 2 .65 

Thus the feeds A and D significantly differ at 0.05 level of 
significance. 

Dunnett’s Multiple Comparison Test: Here two means differ 
significantly if |y t - Yj| exceed t'„ J 2MS e / r where t’a is the value 

from Dunnett’s table for two sided comparisons. 

Here t'„ V2MS e /r = 2.67 x 

Thus the feeds A and D differ significantly at 0.05 level of 
significance. 
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Duncan’s Multiple Range Test 

For applying Duncan’s Multiple range test, we have 
>/MSe/r = Vi .8/5 = 0.6 and d s ,i6,.05 = 3.23, 3.15 and 3.00 respectively 
for s = 4, 3 and 2. Thus D 4 = 1.938, D 3 = 1.89 and D 2 = 1.8. 

If the treatment means are arranged in descending order, they 
correspond to feeds D, C, B and A. All the four means are 
significantly different because the difference of means of feeds D and 
A is 4.95 which exceed D 4 . Now the three means of A, B and C are 
not different because the difference in means of feeds C and A is 
1.88, smaller than D 3 . Again the means of B, C and D are 
significantly different as the difference of means of feed D and B is 
3.99 exceeding D 3 . Thus feed D gives significant results than other 
feeds, while feeds A, B and C do not differ much. This state of affairs 
will be represented by D CBA showing that feeds A, B and C are at 
par, while feed D differ from them. 

Partitioning of sum of squares through contrasts 

Some times it becomes necessary to partition the S.S. due to 
treatment with (s-1) degree of freedom into components with single 
or multiple degree of freedom with meaningful interpretations. This 
we do with the help of contrasts. 

Let Ti, T 2 . T s are the treatment totals each based on r 

replications. 

Then a linear function of T^ T 2 .T s 

Z=I 1 T 1 +I 2 T 2 +.+ l s T s is called a contrast if llj=0 

SS due to contrasts Z=Z 2 /I rjl 2 when the no. of replications for 
each treatment is same. 

If each treatment totals are based upon r* replication then Z=IljTj 
is a contrast if Ir^ = 0 and SS due to Z=Z 2 /Srjl 2 . 

Orthogonal contrasts: Two contrasts Z 1 and Z 2 where 

z, = l,T, + l 2 T 2 .+l s Ts, here £*£'' f jp / . 
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Z 2 = qJi +q 2 T 2 +.+ q s T s . here X qj =•• o 

i-1 

are said to be orthogonal if I I* = 0 if treatment means are 
based on equal number of replications and I n lj qj = 0 if treatment 
means are based on unequal number of replications. 

If we have s treatments the there exist (s-1) mutually orthogonal 
contrasts, each contrast carries one degree of freedom. 

s-1 

Then treatments sum of square = X 55 due to Zj 

i=l 

where Zj’s are mutually orthogonal contrasts. 

In our example, we have 

T, = 10.95, T 2 = 17.75, T 3 = 20.35, T 4 = 35.70 

We have three mutually orthogonal contrasts. 

Z s = - 3T 1 -T 2 +T 3 +3T 4 =-3 x 10.95 - 17.75 + 20.35 + 3 x 35.70 = 76.85 

Z 2 =T i -T 2-T 3+T 4 = 8.55 

Z 3 = -T 1 +3T 2 -3T 3 +T 4 =16.95 

S.S. due to Zi = (76.85) 2 /100 = 59.06 

S.S. due to Z 2 = (8.55) 2 /20 = 3.65 

S.S. due to Z 3 = (16.95) 2 /100 = 2.87 

Here treatment SS = 65.59 at 3 degree of freedom. 

3 

Here we see that treatments SS = X SS due t0 z i 

i=l 

If the feeds A, B C are of one type and feed D be of another type. 
Then the SS due to feeds with 3 degree of freedom can be 
partitioned into 2 components (i) S.S. due to feeds with in the first 
type with 2 d.f. and (ii) S.S. between the types with 1 d.f. 

S.S. due to feeds with in first type 

-T^T, 2 ^ 2 [T 1+ T 2+ T 3 ] 2 

U r 2 r 3 r 1+ r 2 +l 3 

=[(10.95) 2 + (17.75) 2 + (20.35) 2 ]/5-(49.05) 2 /15 = 9.42 



34 


Experimental Designing and Data Analysis 


Comparison between the types of feed, we consider 
2 = T 1 +T 2 +T3“3T 4 = 10.95 + 17.75 + 20.35-3 x 35.70 = 58.05 

(10.95 +17.75 + 20.35-3x35.75) 2 „ 

q q due to Z = —---— =56.16 

b.b. due to z. 5(1 + 1 + 1 + 9) 

The total of these two sums of squares adds upto the feed sum 
of squares. 


Example 

Consider the following data 


Treatment (varieties) 



1 

2 

3 

4 


45 

35 

34 

41 


46 

33 

34 

41 


49 


35 

44 


44 


34 

43 




33 

41 





42 





44 





41 





41 

Total 

184 

68 

170 

378 

Means 

46 

34 

34 

42 

Here n = 4, 

r 2 — 2, 

= 5, T4 — 9, 

GT = 800 



Suppose varieties 1 and 2 are from one origin and 3 and 4 from 
other origin. It would be desirable to have comparisons of 

(i) Varieties 1 and 2 with 3 and 4 

(ii) Variety 1 with 2 

(iii) Variety 3 and 4 

Solution 


Then desired contrasts will be 


Analysis of* Variance^ and Tests for Comparison 

Z, = 7(T 1+ T 2 ) + (-3)(T 3 +T 4 ) 

= 7T t +7T 2 -3T 3 -3T 4 
Z 2 = 1T 1 + (-2)T 2 + 0T 3 +0T 4 =T 1 -2T 2 
Z3 = OT! + 0T 2 + 9T 3 + (-5) T 4 


Zp = 


= 9T 3 - 5T 4 

Here the treatment totals depends upon unequal no. of 
observations. In the first contrast we want to compare the mean of 4 
+ 2 = 6 observations with mean of 5 + 9 = 14 observations. So it is 
necessary to have some weighing for comparison of treatment totals 
based on unequal number of observations. Since the smallest integer 
may be divided evenly by both 6 and 14 is 42, we see that 7 and 3 
are the indicated weights to be used. In the same way coefficient of 
Z 2 and Z 3 are found out and it should be verified that all those three 
contrast should be mutually orthogonal to each other. Then only the 
total treatment sum of square will be equal to the total of the sum of 
square due to contrasts. 

The S.S. for these contrast can be obtained as under. 

S.S. (Z )- [7(184) +7(68)+ (-3)(170)+ (-3)(378)] 2 ni - 
' (4(7 2 ) + 2(7 2 ) + 5(-3) 2 +9(-3) 2 ] 

S s. (Z ) _ [0)0 84) + (~2)(68) + (0)(170) + 0(378)1 2 
2 [4(1 2 ) + 2(-2) 2 +5(0) 2 +9(0) 2 ] 

S S (Z - [(0)(184) + (0(68) + 9(170) + (-5)(378)1 2 _ 

[4(0 2 ) + 2(0) 2 +5(9) 2 + 9(-5) 2 ] 


The above S.S. can also be obtained as under 

SS (Z,) = (184 + 68) 2 /16 + (170 + 378) 2 /14 - (800) 2 /20 =34.3 

SS (Z 2 ) = (184) 2 /4 + ( 68) 2 /2 - (252) 2 /16 = 192.o" 

SS (Z3)= (170) 2 /5 + (378) 2 / 9 -(548) 2 /14 = 205.7 
Now ANOVA table can be made as under: 
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ANOVA Table 


Source of variation 

d.f. 

SS 

MS 

F 

T reatments 

3 

432 

144 

72 

1 and 2 vs 3 and 4 

1 

34.3 

34.7 

17.15 

1 Vs 2 

1 

192.0 

192.0 

96.0 

3 Vs 4 

1 

205.7 

205.7 

102.85 

Error 

16 

32 

2.0 



We find all above difference are significant 


2. Random Effects Model (Variance Component Model): 

One way classification: Let the s levels of a factor be chosen 
randomly from all possible levels of that factor and let a, be the effect 
of the ith chosen level. Since the experimented levels are randomly 
selected, the experimenter will be mainly interested to test the 
equality of the effects for different levels in the population rather than 
the equality of effects of the tested levels. Since the equality of as 
implies that the variance of a s in the population o\ is zero, with a 

random effects model the null hypothesis to be tested will be H 0 = 

= 0 . 


Let the s selected levels of factor A be experimented on n units, 

he ith level being randomly applied to rij units for i = 1, 2, ., s 

( s 




I 

i=l 


n, = n 


. The model assumed will be. 




Yij =p+ aj + ejj, j =1,2,.n f ; i =1,2,....: .,s 

where yjj is the observation on the jth unit to which the ith level of 
A is applied, ji is the general mean, a, is the effect of the ith level of 
factor A and e^s are random errors. It is assumed that aj’s are 
normally and independently distributed with mean zero and variance 
; ej/s are normally and independently distributed with mean zero 

and variance o\ ; aj’s are independently distributed with e^s. The 

ANOVA will be formed as in the case of fixed effect model. The 
average value of mean square for all sampling variations in a's and 
s will be called the expected value of mean squares and will be 
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denoted by E(M.3.) for a one way classification as under: 


Anova Table for a Random Effects model for one way 
classification. 


Source d.f. 


S.S. 


M.S. 


E(M.S.) 


Factor 

A 

Error 


Total 


s-1 


n-s 


n-1 


Y 2 

lA- 

ni 


Y' 


n 


By 

subtraction 


z yij - y2 - 


U 


n 


MS, 


MS e 


2 1 
at + — 


n - 


in* 

n 


2 \ 


Pa 


j 


a; 





ms a -ms, 


1 

s -1 


v n 

^ I 

n-— 

n 


\ 

J 


,x ) n i . , * 2 MS a - MS P 

If n ( s are all equal to r, say, then a £ = ---— under the 

r 

null hypothesis H 0 : <j\ = 0, MS A and MS e will both independently 
estimate a 2 and this leads to the following: 

If MS A /MS e > Fg.!, n-s, a, we reject the null hypothesis H 0 : a A = 0 
and conclude that a A ^ 0 at a level of significance; otherwise not. 


If the feeds of the data of earlier example were randomly taken 
from a populations of feeds with g a as the variance of the effects of 

the feeds, then a A will be unbiasedly estimated by 



21 . 86 - 1.81 

5 


4.01 


One unhappy situation that arises in the variance component 
estimation is that one may get negative estimates for variances and 
in such cases, one may conclude that there is stronger evidence for 
the corresponding population variances to be closer to zero. 

□ a □ 
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Transformation of Data 


The interpretation of data based on analysis of variance 
(ANOVA) is valid only when the following assumptions are satisfied. 

1. Additive effects: Treatment effects and the block effect are 
additive. 

2. Independence of Errors: Experimental errors are independent. 

3. Homogeneity of variance; Observations have common variance. 

4. Normal Distribution: Character under study follows normal 
distribution. 

Also the statistical tests t, F, Z are valid under the assumption of 
independence of errors and normality of characters under study. 

There are certain types of data where all these assumptions are 
not satisfied. For example, when the data are in the form of 
percentage say, seeds that germinate in a plot, the number of rare 
insects present in soil samples or the number of particular types of 
impurities in milk sample, the assumption that the variance is 
constant for the different observations, does not hold good. In the 
first case, the observations have Binomial distribution and hence the 
variance which depends upon the unknown percentage is not a 
constant. In other cases, the observations have a Poisson 
distribution. In such cases, a non-linear transformation of the 
observations satisfies all the above assumptions. For example 
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Model 

Additive 

Multiplicative 

Log 10 X 

Block 

1 

2 

1 

2 

1 

2 

Treatment -1 

10 

20 

10 

20 

1 | 

1.30 

T reatment - 2 

30 

40 

30 

60 

1.48 

1.78 

Treatment effect 

20 

20 

20 

40 




It is seen with the help of the log transformation, multiplicative 
model can be changed into additive model. Another hypothetical set 
of data with multiplicative effects of treatment and replication is_ 


Treatment 

Replication 

Replication effect 


1 

II 

Ml 

100 (1-11) 

/II 

A 

200 

(2.30103) 

125 

(2.09691) 


60 

B 

160 

(2.20412) 

100 

(2.0000) 

60 

(0.20412) 

60 

Treatment 
effect (A-B) 

40 

(0.09691) 

25 

(0.09691) 



100(A-B)/B 

25 

25 




In this case the treatment effect is not constant over replication 
and the replication effect is also not constant over treatments. 
However, when both treatment effect and replication effect are 
present in percentages, their effects become additive with the 
logarithmic transformation y = logx and the replication and treatment 
effect becomes additive. Logarithmic transformed values are given in 
bracket. 

Types of Transformation 
Square root transformation 

Firstly, mean and variance for each treatment may be worked out 
and graph between means and variances may be drawn. In case we 
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get a straight line we conclude that variance is varying with mean. 
Here the original data follows Poisson distribution where the mean 
and variance are equal, so the square root transformation is useful in 
bringing the original distribution into normal distribution. 

For example, the number of plants infected with a particular 
disease in a given area, the number of insects of a particular disease 
in a given area, the number of weed seeds in a sample of seed, the 
bacterial colonies on a plate count etc. follow Poisson distribution. In 
this case Jy transformation is recommended for y, when the original 

values lies between 0 to 30 or 70 to 100. However when th e origina l 

i- n _ \ 


data consists of zeros then 



V(v +1 ) 


transformation may be used. Here the square root transformation 
approximates Poisson to Normal distribution. 

Angular Transformation (Sine Inverse Transformation, or Arc 
Sine Transformation) 


When data relate to Binomial distribution, mean is proportional to 
variance, angular transformation (also termed as sine inverse 
transformation) as given by the equation. 0 = sin 1 X may be used to 
stabilize the variance and transformed data follows Normal 
distribution. Here 9 is in degrees and X is the percentage value. 
Angles corresponding to percentage, Angle = ARC SINE 
^Percentage as given by C.l. BLISS are given in Fisher and Yates 

statistical tables. When all the percentages lies between 30 to 70 the 


data are expected to follow normal distribution and hence no 
transformation is required. The zero percent should be substituted by 

( O 


(1/4n) and 100 by 


100 - 

4n 


where n is the no. of units upon which 


percentage of data was based. 


Here not all percentage data need to be transformed and even if 
they do Arc Sine transformation is not the only transformation 
possible. The following rules may be useful in choosing the proper 
transformation, scale for percentage data derived from count data. 
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Rule 1. The percentage data lying between the range of 30 to 
70% is homogeneous and no transformation is needed 

Rule 2. For percentage data lying within the range of either 0 to 
30% or 70 to 100% but not both, the square root transformation 
should be used. 

Rule 3. For percentage that do not follow the ranges specified in 
Rule 1 or Rule 2, the Arc Sine transformation should be used. * 

Logarithmic Transformation 

When mean is proportional to standard deviation, or in other 
words, if the coefficient of variation is constant, the transformation y= 
log x is used to stabilize the variance. If zero occurs or data sheet 

( 3 ^ 

involves small values (e.g. less than 10) log (y+1) or log y h— is 

v 8 

used. This transformation is particularly useful when the observed 
data consists of big integers such as Index numbers or biological 
population or the population following multiplicative model for 
example numbers of insects per plant, number of egg masses per 
plant or number of leaving larvae on rice plants treated with various 
rates of an insecticides are typical examples. 

This transformation would make errors normal when 
observations follow negative binomial distribution. 

When the data relates to proportions we can use the 
transformation y=log x/(1-x) when x represents the proportion. For 
example, A, denotes the area of a plot affected by some disease and 
A is the total area of the plot then x = A^A (proportion of the area 
affected) and will lie between 0 and 1. 

Reciprocal transformation 

When variances differs from treatment to treatment, if time is 
characteristic under study, reciprocal transformation converting the 
original values to reciprocals, would bring stabilization to the 
variances of the treatments. 
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Probit Transformation 

This transformation is often used in the analysis of dose- 
response relationship when the response variable is a binomial 

proportion -. It is useful when, for example, the threshold of 
n 

response is normally distributed in the test population because in this 
case it reduces the dose response relationship to a straight line. The 

probit corresponding to r/n is defined as (X+5) where X is a 

r 

standardized normal deviate such that <j>(X)=—. The constant 5 is 

n 

added so that the probit is always positive. Thus of r=7, n = 8, then 

-=0.875, from table it is seen that for x = 1.15, <|)(X) = 0.875. Hence 
n 

the probit corresponding — = 6.75. Table giving the probit 

transformation directly are available. 

ANOVA table is constructed as usual manner as the transformed 
data. For overall comparison of means, F test is done as usual with 
the transformed data. Similarly, for pairwise comparison of means, 
LSD or DMRT are done with the transformed data. Of course, the 
final presentation of the treatment means in the original scale, they 
may either be computed from the original values or they may be 
obtained by converting the transformed values to the original scale. 

In certain cases it may so happen that the ranking of means 
scale and the transformed scale are not same. In such situations, 
computation of means by converting the transformed data is more 
appropriate. 

Transformation affects the apparent importance of interactions. 
In some cases, while un-transformed data show no existence of 
interactions, the transformed data may show existence of interaction. 
In such cases, the interpretation becomes difficult and analysis of 
data with the original scale is rather better. 

Example 

The data on percentage of insect survival in 12 rice varieties trial 
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in a completely randomised design with three replications is given 
below. For each plant out of so cased insets were different of 
surviving percentage. Analyze the data using appropriate 
transformation. 


Percentage Survival 


IE5SS9I 

mmm 




Mean. 

Vi 

37.00 

25.33 

47.00 

109.33 

36.44 

v 2 

21.33 

47.32 

81.00 

149.65 

49.88 

v 3 

0.00 

0.00 

0.00 

0.00 

0.00 

v 4 

25.32 

27.88 

51.10 

104.30 

34.77 

v 5 

23.89 

27.32 

52.33 

103.54 

34.51 

V 6 

0.00 

0.00 

21.00 

21.01 

7.00 

V 7 

32.82 

mum 

25.89 

87.38 

29.13 

V 8 

0.00 

0.00 

0.00 

0.00 

0.00 

V 9 

18.89 

35.48 

12.75 

66.79 

22.26 

Vio 

95.89 

100.00 

100.00 

295.89 

98.63 

Vn 

13.56 

32.34 

38.58 

84.48 

28.16 

v 12 

44.33 

48.66 

12.15 

105.14 

35.05 


Solution: 

The arc sine transformation should be used because the 
percentage data range from 0 to 100%.Before transformation all zero 

r . -i 


per cent values are replaced by —i.e. 

A 


1 .1 


4n |_4x75 l J 


and all 100 per cent 


values by 


f 


100 




4n 


i.e. [100-1/4(75)]. The transformed data and the 


analysis of variance is given below. LSD test and the DMRT were 
first applied to the transformed means and then transformed to the 
original means. 
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Transformation of data using Arc sine Transformation: 


Variety 

Transformed values 

Total 

Mean 

Rep-I 

Rep-ll 

Rep-Ill 

Vi 

37.46 


43.28 

110.96 


v 2 

27.51 


64.16 

135.13 


V 3 

0.33 

0.33 

0.33 

0.99 

EEEHB 

V 4 

30.21 

31.87 

45.63 

107.71 

35.90 

V 5 

29.26 

31.51 

46.34 

107.11 

35.70 

V 6 

0.33 

0.33 

27.27 

27.94 

IB 

V 7 

34.95 

32.37 

30.59 

97.91 

32.64 

V e 

0.33 

0.33 

0.33 

0.99 

0.33 

V 9 

25.52 

36.56 

20.92 

83.00 

27.67 

Vio 

78.30 

89.67 

89.67 

257.64 

85.88 

Vn 

24.61 

34.66 

38.40 

94.66 

31.55 

v 12 

41.74 

44.23 

20.40 

106.38 

35.46 


Analysis of variance table 


Source of 
variation 

Degree of 
freedom 

! |2ffijBE3| 

Mean 

squares 

Computed 

F 

Variety 

11 

17024.13 

1547.65 

16.26 

Error 

24 

2284.09 

95.17 


Total 

35 

19308.22 

551.66 



** Significant at 1 % level. 


, ^ ^ (2x95.17 

LSD 05 — 2.03 x y-—- = 18.32 

lpn /2x95.17 _ . 

LSD.oi=2.73 x J---= 21.74 

These LSD values are in the transformed scale and must be 
applied to the transformed means. For example to compare first and 
second variety, the difference of the transformed means is 45.04 - 
36.99 = 8.05 which is less than the LSD value 18.32 at 5 % level of 
significance.Hence variety 1 and variety 2 do not differ significantly. 

□ □ □ 































































Chapter - 4 


Single Factor Experiments 


Experiments in which only a single factor varies while all others 
are kept constant are called single factor experiments. For example 
(i) Fertilizer trials where several rates of single fertilizer element are 
tested, (ii) Insecticides trials where several insecticides are tested, 
(iii) Plant population trials where several plant densities are tested. 

There are two groups of experimental situations that are 
applicable to a single factor experiment. One group is the family of 
complete block designs which is suited for treatments with a small 
number of treatments and is characterized by blocks, each of which 
contains at least one complete set of treatments. The other group is 
the family of incomplete blocks which is suited for experiments with a 
large number of treatments and is characterized by blocks, each of 
which contains only a fraction of the treatments to be tested. 

Hence three complete block designs namely Completely 
Randomized, Randomized Complete Block and Latin Square Design 
will be discussed. The other two incomplete block designs namely 
Balanced Incomplete Block Design and Lattice Design will be 
discussed in the later chapters. 

Completely Randomized Design (CRD) 

Designs are usually characterized by nature of grouping of 
experimental units and the procedure of random allocation of 
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treatment to the experimental units. In a completely randomized 
design the units are taken in a single group and as far as possible the 
unit forming the group should be homogenous in nature. For example 
all the field plot constituting the group are having the same soil 
fertility, soil depth, soil texture, soil moisture, etc., all the cows 
forming the group are of the same breed, same age, same weight, 
same lactation etc. Suppose there are v treatments and the i ,h 

treatment is replicated r* times thus we require rj = n 

■if i - 1 

experimental units. Allote v treatment randomly to n experimental 
units such that the i th treatment occupies r, units or plots. Such a 
design is called CRD. This design is also seldom called a non- 
restrictional design. CRD is most useful for laboratory experiments, 
pot experiments, green house experiments, and sometimes for 
poultry and animal experiments. 

Layout 

Suppose there are four treatments T 1f T 2 , T 3 and T 4 which are 
replicated 4, 3, 3 and 5 times respectively. For layout in CRD the first 
treatment will be allotted at random to 4 units, second and third 
treatment to 3 units and fourth treatment to 5 units of 15 experimental 
units. 


Field plan of CRD with treatments assigned to plots. 


t 2 (1) 

T 2 (6)_ 

Jz (ID 

Ts (2) _ 

T4 (7) 

_Ta (12) 

T„ (3) 

T 4 (8) 

Ti (13) 

T, (4) 

Ti O) 

T, (14) 

Ta (5) 

T„ (10) 

T 4 (15) 


Merits of CRD 

(i) Its layout is very simple. 

(ii) There is complete flexibility in this design i.e. any number of 
treatments and replicates for each treatment can be taken. 
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(iii) Whole experimental material can be utilized in this design. 

(iv) The analysis is not complicated by missing data. 

(v) The statistical analysis is not complicated by unequal 
replication of treatments. This is not the case with more 
complex designs. 

(vi) The design offers maximum error degree of freedom. No 
other design with the same total number of plots and 
treatments provides more error degree of freedom than does 
the CRD. 

Demerits of CRD 

CRD suffers from many lacunae which are as follows 

(i) The design is suitable for a small number of treatments. 

(ii) It is difficult to find homogenous experimental units in all 
respect. 

(iii) It is seldom suitable for field experiments as compared to 
other experimental designs. 

Analysis 

This design provides a one way classified data according to the 
levels of a single factor. For its analysis following model is used 


Yjj = p + tj + ey i = 1, 2,.v; j = 1, 2.n 

Set up a table of observations and compute the treatment totals. 


Treatments 





Total 

Mean 

1 

Yn 

Y 2 i 


Yih 

Ti 

7i 

2 

Yi 2 

CM 

CM 

> 


Y 2 r 2 

t 2 

y 2 















V 

Y v i 

CM 

> 


Y v r v 

T v 

7r 

Total 





G 

V 


where is the observation obtained from the j ,h replicate of the 
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i th treatment, ju is the general mean, t f is the fixed effect of the i th 
treatment and ejj is the error component which is random variable 
assumed to be normally and independently distributed with zero 
mean and a constant variance a 2 . 

Here = Tj = Sum of the observations of the i fh treatment 
j=i 

y, = T/rj = mean of the i ,h treatment 

V 

G = ^Tj = grand total of all the observation 

i=1 

y = G/n = grand mean; here n is the total no. of observation. 

Compute correction factor 

CF = G 2 /n 

Then total sum of squares (SSTOT) = Z2>f- CF 

i j 

v 

Treatment sum of squares (SST) = ^ Tj 2 /^ - CF 

i=1 


Error sum of squares (SSE) = Total SS - Treatment SS 

ANOVA 


Source 

d f. 

SS 

M.S.S. 

F 

Treatments 

v-1 

SST 

SST/v-1 =MST 

MST/MSE-F[v-1, 

(n-v)] 

Error 

(n-v) 

SSE 

SSE/(n-v) 

MSE 


Total 

n-1 


SSOT 



Where MST and MSE are the mean squares for treatments and 
error respectively. If the calculated vale of F is greater than the table 
value [a, (v-1), n-v] where a denotes the level of significance. The 

Hypothesis H 0 : = p 2 .= p v (the true mean of all treatments are 

the same) against H 0 : Mi * p 2 . ^ p v (at least one mean differs 
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from the others) is rejected and it can be inferred that any two 
treatment effects are significantly different from each other. 


SE of any treatment mean say i th treatment mean is 


SE m 


Imse 

V 


SE of difference between any two treatment means is 


SE d = Imse 


v r i +r U 


where n and n are the number of 


replications for the i th and j Tn treatment. 


:th 


If n = q = r then 


SEh = 


'2MSE 


The critical difference (C.D.) can be calculated as 
C.D. = SE d x t( a> n . v ) 

Where t (( , i n . v) is the two tailed value of‘t’ at a level of significance 
and at (n-v) error degree of freedom. 

Means are arranged in ascending or descending order of 
magnitude for comparison. Any two treatments means are said to 
differ significantly if there difference is larger than the critical 
difference. 


Example 

The yield of Lentill (Kg/plot) from 20 demonstration plots on 
various soil types is given below. Use CRD for analysis to see if there 
were yield differences due to soil type. Lentill seed yields (Kg/plot) 
from 20 demonstration plots on various soil types are given in 
following table. 

Analysis 

CF = G 2 /n = (546.9) 2 /20 = 14954.98 
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Total sum of squares = II Y iV CF 

i J' 

= (41.3) 2 + (33.7) 2 .+ (29.5) 2 - CF 

= 16217.25 - 14954.98 = 1262.27 


Soil type 


T reat- 

ments 

1 

2 

3 

4 

5 

H 


41.3 

29.2 

19.7 

41 

31.5 

n 


33.7 

27.5 

20.6 

36.9 

25.7 



28.9 

23.4 

14.1 

32.0 

29.5 




20.3 

11 

32.8 





19.9 


27.9 



Sum 

103.9 

120.3 

65.4 

170.6 

86.7 

546.9 

n 

3 

5 

4 

5 

3 

20 

Mean 

34.63 

24.06 

16.35 

34.12 

28.90 | 

27.34 


Treatment sum of square = — 1 —CF 

n 

(103.9) 2 , , (86.7) 2 cp 

3 . 3 

= 15888.61 -14954.98 = 933.63 

Sum of square due to Error 
= Total SS - Treat SS 

= 1262.27-933.63=328.64 


ANOVA Table 


Source 

d.f. 

SS 

MS 

F 

Soil type 

4 

933.63 

233.41 

10.65 

Error 

15 

328.64 

21.91 


Total 

19 

1269.27 
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Significant at the 1 % level 
The Table value of F 01 (4,15) = 4.8932 < 10.65 
which means the treatments differ significantly 

Standard Error of mean = / MSE 

n 


SE v t = 


'21.91 


= 2.70 


SE y 2 = 


21.91 


= 2.09 


SE; 


'21.91 


= 2.34 


SE; 


'21.91 


= 2.09 


SE Vs = 


'21.91 


= 2.70 


SE of difference between type 1 and type 2 soil. 


SE & ,-y z) = JMSE 


'l + l' 


f 


= .21.91 


1 1 

-1- 

3 5 


\ 


= 3.42 


SE ( y ,- y 3 ) = -' 21 ’ 91 


ft 1 
—+ — 
3 4 


= 3.57 


f 


SEft-y 4) = J21.91 


1 1 

-1- 

3 5 




= 3.42 


SE (yi-y 5 ) =J21.91 


(1 1 
3 + 3 


\ 


= 3.82 


SE (y 2 - y 3 ) = J21.91 


- 1 = 3.14 
[5 4 
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(ya-yj 


(y 2 -v 5 ) 


(y 3 --y 4 ) 


(y 3 -y 5 ) 


SE 


(y 4 -y 5 ) 



Mr 

,5 + 5, 

J 21 - 9 ' 

rir 

U + 5j 

| 2 '- 9 ' 

nr 
U + sJ 

J 2 ' 91 

nr 

,4 + 5, 

J 21 ' 9 ' 

Mr 

<S + 5j 


= 2.96 


= 3.42 


= 3.14 


= 3.58 


= 3.42 


C.D. values for comparing the different pairs of treatment can be 
calculated. For example for comparing y t , with y 2 we had 


CD at 1% level of significance = SE ^ ^ x t ( 0115) 

= 3.42x2.947= 10.08 

|y t - y 2 1 = |34.63 - 24.06| = 10.57 > 1.55 So the means y x and 
y 2 differ significantly. In the same way CD for all the pairs can be 
calculated at the desired level of significance. 

Analysis of yield data from 20 demonstration plots indicates that 
there are highly significant differences in yield among the five soil 
types represented in the trial. The results are presented in the 
following table. 


Mean yield (kg/plot) of wheat grown on five soil types 



1 

2 

3 

4 

5 

Mean yield 

34.63 

24.06 

16.35 

34.12 

28.90 

Replication 

3 

5 

4 

5 

3 

Standard 

Error 

2.70 

2.09 

2.34 

2.09 

2.70 


It is apparent that soil type-1 produce the highest yield of lentil 
seed. Soil type 3 is clearly inferior to others. The soil types can be 
compared with the CD values. 


k 
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Randomized Block Design 

The Randomized block design (RBD) or more, correctly the 
randomized complete block design is the design used most often in 
agricultural research. When the experimental material is not entirely 
homogeneous, we divide the material in to homogeneous groups of 
the experimental units by adopting the principle of local control. The 
object is to make the variation from plot to plot as small as possible 
with in the blocks while maximizing the variation among blocks. 
Changes in technique or in other condition that might affect the 
results should be made between blocks. The number of blocks in the 
design is the same as the number of replications. 

Suppose we want to compare the effect of v treatment, each 
treatment being replicated an equal number of times say ‘r\ We need 
n=rt experimental units. The first step is to divide the units in to r 
homogeneous blocks each consisting of v experimental units. Then 
the second step is to allot the treatments randomly to the units of the 
block. The randomization is to be done afresh for each block. 

For actual layout in the field, blocks are formed perpendicular to 
the direction of fertility gradient in the field and then blots with in each 
block are formed in the direction of fertility gradient. All block and 
plots must have the same size. 

Layout of RBD 

Let there be five treatments A, B, C, D and E each replicated 5 
times. The experiment area (fertility of the area having a slope from x 
to y) can be divided in to five blocks perpendicular to the direction of 
fertility gradient as shown below and then each block is divided in to 
five plots for the random allotment of the treatments. 

Salient features of RBD 

* 

1. It can remove one source of variation from the experimental 
error and thus increased precision. 

2. By blocking blocks under different conditions it can broaden 
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the scope of the trial. 

3. Any number of treatments and any number of blocks may be 
used. The only restriction is that each treatment must be 
replicated the same number of times in each block. 


Field plan of RBD 



4. Statistical analysis is fairly simple. 


5. Even some treatments are missing in one or more blocks, 
the statistical analysis can be performed with the help of 
missing plot technique. If the numbers of missing units are 
more in any particular block, the statistical analysis can still 
be done by removing the entire block from the analysis. 

6. In clinical or similar trials where animals like rats, guinea pigs 
etc., are the experimental units, animals coming from the 
same litter may form blocks. Characters like age, sex, vigour, 
breed etc., could be considered for forming blocks. Milk of 
the same fat contents would be considered as blocks in dairy 
experiments. Shelves of the germinators could be blocks in 
horticulture experiments, soils of the same type would from 
block in soil science experiments. 

In the green house experiments, if the access of air 
current and sunlight are across the bench then the block 
would be formed along the bench and vice - versa 

7. The design is less efficient than others in the presence of 
more than one source of unwanted variation. 
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Analysis 

The data collected from experiments with randomized block 
design form a two way classification that is, classified according to 
the levels of two factors, viz. blocks and treatment. There are vr cells 
in the two way table with one observation per cell. 

The model used is 

Yjj = p + tj + bj + ejj i = 1,2, ....v; j = 1,2 .... r 

where Yjj denotes the observation for the i th treatment in the i th 
block, p, tj, bj are respectively the general mean, effect of i th treatment 
and effect of j th block. These effects are fixed and is the random 
error component which is distributly normally and independently with 
zero mean and constant variance a 2 . 

The data can be arranged in the following two way classification 
table. 


Blocks 



1 

2 

■ 

j 


r 

Total 

Treatment 

1 

Yn 

Y12 

l 

Yii 

1 

Yi r 

Yi. = 

Ti 

2 

Y 2 i 

y 22 

l 

Y 2j 

1 

Y 2r 

ii 

£ H 

• 



1 


1 



• 





1 



i 

mm 

Y i2 

■ 

JflL_ 


Y ir 

m 

. 



■ 





. 



■ 





V 

Yvi 

Yv2 

l 


1 

Yvr 

Yv. = 
T v 

Hi 

Y.1 

=Bi 



Y.j =Bj 


Y.r = B r 

G 

























56 


Experimental Designing and Data Analysis 


Let Yy = Yj. = Tj(i = 1,2...v) = Total of observation from the i th 
i 

treatment 


and ^ Yy = Y.j =Bj(j = 1,2...r) = Total of observations from j th 


block. 


Further IT; = IBj = G 


Correction factor (CF) = 


rv 


Sum of squares due to Treatments = 


TV 


-CF 


Bi 


Sum of square due to blocks = !—CF 

j v 

Total sum of equal = 2>if-CF 


I.J 


Sum of square due to Error = Total SS-SS due to treatment - SS 
due to blocks 


Here Null hypothesis H 0 : t 1 = t 2 . = t v against the 

alternative that any two t’s are not equal 

Analysis of variance table (ANOVA) 


Source 

d.f. 

S.S. 

MS 

F 

Blocks 

r-1 

SSB = _ Bi 2 

y —— cf 

i v 

MSB = SSB/(r-1‘) 

MSB ~F[( r .i), (r-i) ( V .i)] 
MSE 

Treatment 

v-1 

SST = ^. CF 

r 

MST = SST/(v-1) 

MST - ~F[(v-1). (r-1) (v-1)) 

MSE 

Error 

(r-1) (v-1) 

SSE= 

By subtraction 

MSE=SSE/(v-1)(r-1) 


Total 

vr-1 

I2>«- CF 

i i 




H F ca i > F[<x; (v-1), (v-1) (r-1)] then H 0 is rejected for treatment effects. 
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1. The Standard Error of treatment means SE m = 

2. The standard error of difference between any two treatment 
means 

SE,- jiff 

Critical difference at a level of significance 
CD = SE d x [t a , (r-1) (v-1)]. 

If the difference of any two treatment means is greater than the 
CD value, the corresponding treatment effects are significantly 
different. 

Example 

An agronomist wanted to determine the effect of different 
sources of nitrogen on the yield of barely used as a forage crop. 
There are five sources of nitrogen. 

(1) (NH 4 ) 2 S0 4 (2) NH 4 H0 3 (3) CO (NH 2 ) 2 (4) Ca (N0 3 ) 2 and (5) 
NAN0 3 . He also decided to use (6) control treatment with no 
nitrogen. The field plan and yields (kg/plot) are given below. Analyze 
the data using RBD 


Soil type 

II III IV 


■ 

(5) 

27.2 

(1) 

36.5 

(6) 

27.2 

(4) 

32.3 

(3) 

26.3 

(2) 

30.7 

(1) 

40.7 

(2) 

31.8 

(6) 

23.2 

(6) 

24.1 

(5) 

32.8 

(6) 

25.9 

(2) 

30.6 

(5) 

32.0 

(4) 

36.5 

(D 

36.3 

(4) 

25.4 

(3) 

28.3 

(3) 

33.1 

(5) 1 

32.2 

(1) 

31.5 

(4) 

32.1 

(2) 

38.5 

(3) 

32.1 
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Analysis 


Prepare two way table of yield (kg/plot) of Forage barley under 
various sources of nitrogen. 


Treatment 

Soil type 

Total 

Mean 


1 

II 

III 

IV 



1 

31.5 

36.5 

40.7 

36.3 

145.0 

36.25 

2 

30.6 

30.7 

38.5 

31.8 

131.6 

32.90 

3 

26.3 

28.3 

33.1 

32.1 

119.8 

29.95 

4 

25.4 

32.1 

36.5 

32.3 

126.3 | 

31.57 

5 

27.2 

31.0 

32.8 

32.2 

124.2 

31.05 

6 

23.2 

24.8 

27.2 

25.9 

100.4 

25.10 

Total 

164.2 

183.3 

208.8 

190.6 

747.3 


Mean 

27.37 

30.53 

34.80 

31.77 


31.14 


Computations 


CF = 



rv 


(747.3) 2 

24 


23269.05 


Sum of squares due to treatment = 


it ; 2 

r 


-CF 


[(145.0) 2 +.. + (100.4) 2 ] cp 
4 

= 23538.27-23269.05 = 269.22 

Sum of squares due to blocks = IB 2 /v - CF 

[(164.2) 2 + .. + (190.6) 2 ] cp 
6 

= 23438.85 - 23269.05 = 169.80 

Total sum of square = IIY 2 j - CF 

= (31.5) 2 + (30.62) 2 +....+ (25.9) 2 - CF 

= 23746.95 - 23269.05 = 477.90 

Sum of squares due to Error = Total SS - SS due to treatment - 
SS due to blocks 


477.90 - 269.22 - 169.80 = 38.88 
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Source 

d.f. 

SS 

MS 

F 

Soil (Blocks) 

3 

169.80 

56.60 

21.84** 

Fertilizer 
(T reatment) 

5 

269.22 

53.84 

20.78** 

Error 

15 

38.88 

2.59 


Total 

23 

477.90 




** Significant at 1 percent level. 

Here tabulated F 0 .oi( 5 f i 5 ) = 4.56 Since 20.78 > 4.56 
We conclude that differences among the nitrogen sources 


means are significant at 1% level. 

Again tabulated F 0 .oi( 3 .is) = 5.42 since 21.84 > 5.42 

We conclude that blocking on soil type was effective in reducing 
the experimental error. 

1. Standard Errors 



2. Standard error of a difference between two treatment mean 
^ /2MSE |2x2.59 . . . 

SEd= i-r =>Hr - =1 - 14 


CD at 1 % = SE d xt (.01, 15) 

= 1.14x2.95 = 3.36 

Mean yield (kg/plot) of different treatments. 


Source 

1 

2 

3 

4 

5 

6 

SE m 

CD 1 % 

Mean 

36.3 

31.8 

32.1 

32.3 

32.2. 

25.9 

0.80 

3.36 


Relative efficiency of RBD vs CRD 


Efficiency = 


Amount of information in RBD 
Amount of information in CRD 


Amount of information for any design = 0%±H x _ ] _ 

(n e + 3) Error mean square 
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where n e is the degree of freedom in the design for error. 

If RBD were conducted in CRD i.e. if the blocks were not formed 

than 


Error mean square in CRD = — Eb + ( n j _ +n e) E e 

n b +n ( +n e 


3 x 56.60 + (5 + 15) x 2.59 
3 + 5 + 15 


9.64 


where n bl n t and n e are the degree of freedom of blocks, 
treatments and error respectively and E b and E e are the block and 
error Mean squares 

Amount of information in CRD = - e x-—- 

(n e + 3) Error mean square 


(18 + 1) 1 
(18 + 3) X 9.64 


0.0939 


Amount information in RBD 


( n e+1) x _ 1 

(n e + 3) Error mean square 


16 1 
18 X 2.59 


0.3432 


Efficiency of RBD as compared to CRD 


Amount of information in RBD 
Amount of information in CRD 


0.3432 

0.0939 


x 100 = 365.49% 


Gain in efficiency = 365.49 percent 


Latin Square Design 

Another type of experimental design, which is more restrictive 
then RBD is Latin Square Design (LSD). Here the number of plots is 
the square of the no. of treatments and are normally used in 
experiments to remove the heterogeneity of experimental material in 
two directions. For example in field experiments when the fertility 
variation is in two perpendicular directions or there is diagonal trend 
of fertility or the direction of fertility variation is not known. In animal 
science or other experiments when there are two source of variation 
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which need two way blocking e.g. for the data on milk yield the breed 
and the lactation number are two source of variation. Like wise in 
experiment on increase in weights of pigs, the initial weights and the 
breed of pigs are the two source of variation. 

For testing v treatments v 2 units are divided into v rows and v 
columns and the treatments are allotted in such a way that each 
treatment occurs exactly once in each row and once in each column. 
This arrangement is called latin square. The term v is called order of 
the latin square. There is two way blocking one as row blocks and 
other as column blocks such that each row as well each column is a 
complete replicate. 

Example: 

To randomize a Latin square we start with a basic Latin square. 
Such arrangements -are readily available in the Tables for 
Statisticians and Biometricians (Fisher and Yates, 1948, 1974) for a 
given order. For example we have 5x5 Latin square. 


Columns 


Rows 

1 

2 

3 

4 

5 

1 

A 

B 

C 

D 

E 

2 

B 

C 

D 

E 

A 

3 

C 

1 D 

E 

A 

B 

4 

D 

E 

A 

B 

C 

5 

E 

A 

B 

r C 

D 


Now the rows are randomized 


Columns 


Rows 

1 

2 

3 

4 

5 

1 

A 

B 

C 

D 

E 

2 

B 

C 

D 

E 

A 

5 

r e 

A 

B 

C 

D 

4 

D 

E 

A 

B 

C 

_3_ 

C 

D 

E 

A 

B 


Next columns of the row randomized square are randomized. 
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Columns 


Rows 

5 

2 

3 

1 

4 

1 

E 

B 

C 

A 

D 

2 

A 

C 

D 1 

B 

E 

5 

D 

A 

B 

E 

C 

4 

C 

E 

T 7- 

D 

B 

3 

B 

D 

E 

C 

A 


As a result of row and column randomization, but not the 
randomization of individual units, the whole arrangement remains a 


latin square. 

Advantages 

(1) With a two way grouping this design controls more of the 
variation than CRD and RBD. The two way elimination of 
variation often results in small error mean square 

(2) The analysis is simple 

(3) The analysis remains relatively simple even with missing 
data. Analytical procedures are available for omitting one or 
more treatment, rows or columns. 

(4) Latin square design have a wide variety of application for 
experimental work. They are used in industry, laboratory, 
field, green houses, educational, medical, marketing and 
sociological experiments. 

Disadvantage 

( 1 ) The number of treatment is limited to the umber of rows and 

columns. In general 4 x 4 to 12 x 12 Latin squares should be 

used in practice. For the latin square less than 4x4 order 

the error d.f. would be too small and the number of 

experimental units would be too large for the designs bevond 
12x12 size. y 

( 2 ) Repeated Latin square of the size 2 x 2 or 3 x 3 are found 
useful in certain situations to keep the error degree of 
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Analysis of Latin Square Designs 


In Latin square design there 
from this design are therefore 
data. Here the model is 


are three factors. The data collected 
analyzed as a three way classified 


Yjjk M + H + Cj + t k + e ijk . i, j, k =1,2 .v 

Where y ijk is the observation pertaining to i (h row 
k h treatment. 


j th column and 


n, c j( t k (i,j, k = 1 .... V ) are fixed effects denoting in order the 
general mean, the row, the column, and the treatment effects. The 
ei jk ’s are error component assumed to be independently and normally 
distributed with zero mean and constant variance o 2 . 

The analysis is conducted by following a similar procedure as 
described in RBD. 


Let the data be arranged first in a row x column table such that y, 
denotes the observation of (i,j) th cell of table. 



Columns 


1 2 ■•••i.. 

Total 

1 


Ri 

2 


Rows 



i 

Yi, 

Ri 

__ v 


R v ! 

Total 

O 

O 

o 

< 

V ! 

G 


Let R, = ^ y.. = j th row total (i = 1, 2, ... v) 


Cj = ^ yjj = j th column total (j = 1, 2.. v) 

i 

T k = Sum of those observations which comes from k ,h treatment 
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(k = 1,2... v) 




i=1 k=1 


Correction Factor CF = —=- 

Total sum of squares (TOT SS) = ^ Yj^ - CF 


Treatment sum of squares (SST)= —^—C.F. 


Rows Sum of squares (SSR) = —C.F. 


Column sum of squares (SSC) = —-C.F. 


Analysis of variance of v x v latin square design 


MS 


of 

variation 


Rows 


Error 


Total 


d.f. 

SS 

v-1 

TR 2 

1 CF = SSR 

V 

v-1 

IC 2 

1 CF = SSC 

V 

v-1 

IT 2 

k CF = SST 

V 

(v-1) 

(v-2) 

By subtraction 

(SSE) 

v 2 -1 

X y V-cf 


v -1 


SSC 
v -1 


SST 





The hypothesis of equal treatment effects is tested by F test. F is 


























Single Factor Experiments 


65 


the ratio of treatment mean squares to error mean square. If F is not 
significant treatment effect do not differ significant among 
themselves. If F is significant further studies to test the significance of 
any two treatment means can be made exactly in the same way as in 
RBD 

CD at a level significance = SE d x t [(a, (v-1) (v-2)] 

Efficiency 


Relative efficiency of Latin square design in comparison to with 
RBD and CRD. 

Let the sketch of ANOVA of LSD be 


Source 

d.f. 

MSS 

Rows 

n r 

E r 

Columns 

n c 

Er. 

T reatment 

n, 

E, 

Error 

n e 

E e 


Relative Efficiency of LSD in comparison with RBD 

(1) If row classification had not been made i.e. it the columns 
were treated as blocks, then estimate of error means square 
in RBD 

s 2 _ n r E r +(n t +n e )E e 
2 n r + n t +n e 

Taking LSD as design Dt and RBD as D 2 we can find the 
Relative Efficiency using Fisher’s formula 

D 1 (LSD) D 2 (BDR) 

ni = n e n 2 = (n r + n e ) 

s 1 = E e s 2 = as obtained above. 

The RE of D 1 (LSD) as compared to D 2 (RBD) is 
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2 

PE = ( n i +1 )( n 2 + ^)s 2 x -j oo% and Gain in efficiency 
(n 2 +1)(n 1 +3)s 1 2 

= (RE-100) 

(2) It column classification had not been done i.e. if the rows 
were treated as block then 

Estimate of Error mean square in RBD 
$ 2 _ n c E c +(n t +n e )E e 
n c +n t +n e 

Here 

Di (LSD) D 2 (RBD) 

n i — n e n 2 = n c +n e 

s^ = E e s 2 2 = as above. 

and using Fisher formula as in the previous case we can find 
the RE and gain in efficiency of LSD in comparison with RBD. 

(3) Relative Efficiency of LSD in comparison with CRD when 
both rows and columns are merged then 

Estimate of Error mean square in CRD 

g 2 _ r*r E r +n c E c +(n^ +n e )E e 
2 n r +n c +n t +n e 

Taking 

Di (LSD) D 2 (CRD) 

ni = n e n 2 = n, + n c +n e 

Si = Eg s 2 = as above. 

We can find the RE of LSD in comparison with CRD as 
discussed earlier. 

Example 

A bacteriologist wanted to determine the effect of four different 
sources of inoculums, A, B, C, and D and a control, E, on the dry 
matter yield of a crop. The plots were to be furrow irrigated and the 
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agronomist suspected that there might be a differential effect of 
irrigation from one end of the furrow to the other. The field was also 
bordered by a row of trees and he suspected a differential effect of 
shading as distance from the tree line increased. He decided to use 
square plots in Latin square design with rows across the irrigation 
gradient and columns across the shade gradient. The field plan and 


total dry matter yields (kg/plot) are given below. 


A 

B 

• yj 

D 

C 

E 

32.8 

32.9 

31.6 

33.5 

25.4 

D 

E 

B 

A 

C 

36.4 

29.8 

32.6 

33.1 

34.3 

C 

D 

A 

E 

B 

35.3 

36.5 

37.2 

27.6 

34.6 

E 

A 

1 C 

B 

D 

34.2 

37.8 

36.8 

38.0 

35.2 

B 

C 

D 

E 

A 

33.8 

38.7 

32.6 

37.6 

35.9 


Field plan and dry matter yields (kg/plot) for an inoculum source trial. 


Computation 


Row-by- column summary table of data from a Latin square 
design 





Column 

Row 

1 

2 

3 

4 

5 


1 

32.8 

32.9 

31.6 

33.5 

25.4 

mmm 

2 

36.4 

29.8 

32.6 

33.1 

34.3 

166.2 

3 

35.3 

36.5 

37.2 

27.6 

34.6 

171.2 

4 

34.2 

37.8 

36.8 

38.0 

35.2 

182.0 

5 

33.8 

38.7 

32.6 

37.6 

35.9 

178.6 

Sum 

172.5 

175.7 

170.8 

169.8 

165.4 

854.2 


1. Sums of squares: 
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CF= ^- = ( 854 - 2 ) 2 =29,186.31 
v 2 25 

SS Tot = II Y,f -CF = 29,434.40-29,186.31 = 248.09 

i i 


Treatment Totals and Means of Dry Matter 


T reat. 

A 

B 

C 

D 

E 


Total 

176.8 

171.9 

178.6 

177.3 

149.6 


Mean 

35.36 

34.38 

35.72 

35.46 

29.92 



S SR =ll R2 - CF = 1 (156.2 s + +178.6 2 ) - CF = 84.15 


SSC = -]Tcf -CF = -(172.5 2 +.+165.4 2 )- CF = 11.41 

v . 5 

4 

SST = -YT k 2 -CF = -(176.8 2 + + 149.6 2 )-CF = 117.95 

v r 5 


SSE = SSTOT - SSR - SSC - SST 

= 248.09 - 84.15 - 11.41 - 117.95 = 34.58 

ANOVA Table 


Source 

d.f. 

SS 

MS 

F 

Rows 

4 

84.15 

21.04 

2.43 

Columns 

4 

11.41 

2.85 

0.99 

T reatments 

4 

117.95 

29.49 

1 10.23** 

Error 

12 

34.58 

2.88 


Total 

24 

248.09 




** Significant at the 1% level 

Here Tabulated F 0 .o 5 ( 4 .i 2 ) = 3.26 F 0 . 0 i( 4 .i 2 ) = 5.41 

Since 10.23 > 5.41, treatment differences were highly significant. 
Since 2.43 < 5.41, rows were not effective in reducing error. 
Since 0.99 < 3.26, columns were not effective in reducing error. 
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a. Treatment means: 




b. Difference between two treatment means: 
SE„. 

V v v 5 


CD = SE d x t, 0 01 , , 2 ) = 1.07 x 3.05 = 3.27 

Mean dry matter yield (kg/plot) with Various Sources of Added 
Incolumn. 


Source 

A 

B 

C 

D 

None 

SE 

CD at 1% 

Mean 

Yield 

35.36 

34.38 

35.72 

35.46 

29.92 

0.76 

3.27 


Statistical analysis of data from an experiment designed to 
measure the effect of different sources of seed inoculums on the dry 
matter yield of a crop indicates that differences among treatment 
means were highly significant. 

There appeared to be no differential effect of irrigation as well as 
distance from shade from one end to the other. 


□ □ □ 



Chapter - 5 


Repeated Latin Squares, Graeco Latin 
Square, Cross Over and Youden Square 

Designs 


A latin square design is not normally used when the number of 
treatments is eight or more, because a design of higher order 
requires too many replications and it may not also be possible to get 
the required type of experimental units. Again, if the number of 
treatments is very small, then also there are difficulties for adopting a 
latin square design. When there are two treatments a latin square 
design cannot be adopted, because from a 2 X 2 latin square design 
error variance is not estimable. For a 3 X 3 latin square design the 
degrees of freedom for error sum of squares is 2 and for a 4 x 4 latin 
square it is 6. In both these cases the degrees of freedom for error 
sum of squares are too small. To make the design more effective in 
such cases, the latin square design may be repeated, that is, instead 
of taking one latin square, a number of, say, p latin squares each of 
the same order, is taken for the experiment. The treatments are the 
same in each square but each has a separate set of units and a 
separate randomization. The data obtained from such repeated latin 
squares are analyzed as below. 

Each of the p latin squares is first analyzed separately by 
following the method described in previous chapter. The 
corresponding sums of squares obtained from the different squares 
are then added. This gives the pooled row, column, treatment and 
error sums of squares. The pooled row sum of squares is also called 
between rows within square sum of squares and similarly for the 
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other pooled sums of squares. 

From each of the p squares, the k treatment totals are obtained, 
With these totals a p x k table of squares x treatment is prepared. 

Let Pj denote the total of all observations in i ,h square, (i = 1,2,..., p) 


T s denote the total of observations of s ,h treatment from all the 
latin squares, (s'= 1, 2,..... k) 

and T is denote the total of all observations of s th treatment in i th 
square. The square x treatment table is formed with these Tj S totals. 
The totals Pj and T s are the marginal totals of the Square x 
Treatment Table. 


Next, the following sums of squares are obtained: 
Correction factor = (^T Pj 2 )/pk 2 


i 


Sum of squares due to squares = ^ 


Sum of squares due to treatment = — 


kp 


C.F. 

-C.F. 


Treatment x square interaction 

j2 

= (Pooled treatment s.s.) - ( V — - C.F.) 

r k p 

Total sum of squares = 2>jjs>- c - F - 

where, y ijst denotes the observation from t ,h square in its i th row, i th 
column and under s treatment. The following partitioning of the 
degrees of freedom of analysis of variance then follows: 

The treatment x square SS may be pooled with error SS if there 
is no treatment x square interaction. If the squares are at different 
locations, then under certain hypothesis, it is appropriate to use the 
treatment x square mean square to test the treatment differences. 
Otherwise the testing and other procedures are the same as in the 
ordinary latin square designs. 
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Partitioning of Degrees of Freedom in the Analysis of Variance 
of Repeated Latin Squares 


Sources of variation 

d.f. 

Squares 

p-i ~~ 

Between rows within squares (pooled) 

P(k-1) ^ 

Between columns within squares (pooled) 

Pfr-1)____ 

T reatments 

k-1 

Treatment x squares interaction 


Error (pooled) 


Total 



Graeco Latin Square Designs 

We have seen that by using latin square designs treatment 
effects can be estimated by eliminating two sources of variation. This 
technique can be extended further to eliminate more sources of 
variation. A graeco latin square is one such design where three 
sources of variation can be estimated. This design can be defined 
formally as below. 

An orthogonal design by which three sources of variation can be 
eliminated by using only k 2 units where k is the number of treatments 
is called a k x k graeco latin square design. 

A design is said to be orthogonal when the data obtained from it 
are orthogonal. When all the levels of a controlled factor occur with 
each level of any other controlled factor in a design, the data 
obtained from such a design are always orthogonal. 

In order to make the data from a graeco latin square orthogonal, 
we have to take another controlled factor R at k levels in addition to 
the two controlled factors P and Q introduced for the latin square 
design, such that each level of R occurs with each level of P and also 
of Q only once. If treatments ip an ordinary latin square design are 
taken to be levels of the factor R, then such an arrangement is 
provided by the latin square design. In that case we have to allocate 
treatments to the units such that each treatment occurs once with 
each level of the controlled factors P, Q and R in the k 2 units. For 
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example, let us replace the five treatments symbols A, B, C, D and E 
in the example of the latin square design written systematically by the 
five greek letters «, (T y, 8 and 0 respectively. These letters represent 
the five levels of the factor R. We then get the following arrangement 
of five levels of each of three factors in 5 2 units: 


Level of Q 

Levels of 

P 

a 

P 

Y 

8 

0 

3 

Y 

6 

0 

a 

_Y__ 

5 

0 

a 

3 

5 

0 

a 

3 

Y 

0 

a 

p 

_Y 

8 


Now, five treatments denoted by A, B, C, D and E are to be 
allotted to the above 25 units such that each treatment occurs once 
in each row, each column and with each of the greek letters. 


It is not easy to allot the treatments to the units so as to satisfy 
the above requirements of a graeco latin square design. The problem 
can, however, be solved easily by exploiting the properties of 
orthogonal latin squares described below. 


Orthogonal Latin Squares 


Two latin squares each of the same order, say, r are said to be 
orthogonal if when one is superimposed on the other each symbol of 
one falls on each symbol of the other once and only once. 


For example, the follow ing two latin squares are orthogonal. 


Square 1 

A 

B 

C 

B 

C 

A 

C 

A 

B 


Square II 

a 

— 

p 

Y 

_Y_ 

a 

3 

p 

_Y 

a 


Orthogonal latin squares are also available in the Tables for 
Statisticians and Biometricians by Fisher and Yates (1948). 


For construction of a graeco latin square design we have to take 
two orthogonal latin squares each of order k. One of these squares is 
written by using latin letters and the other by* using greek letters. By 
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superimposing one square over another and by treating the greek 
letters as the levels of the third controlled factor R, we get the graeco 
latin square design. The rows and columns of the square denote 
levels of the other two controlled factors. 


For example, the following is a graeco latin square with five 
treatments. 


aA 


yD 

5C 

0B 

3B 

yA 

5E 

0D 

aC 

_ yc 

8B 

0A 

aE 1 

3D 

8D 

ec 

aB 

3A 

yE 

0E 

aD 

mrmm 

_yB 

5A 


The randomization procedure of this design is the same as that 
of the latin square design. The analysis is alsc the same excepting 
that an extra component of the sum of squares due to the factor R is 
to be obtained in addition to the usual sums of squares for the latin 
square design. 


ANALYSIS OF VARIANCE TABLE 


Source 

d.f 

ss 

MSS 

F 

Rows 

k-1 

y —-c.f. 

^ k 

MSR 


Columns 

k-1 

r 2 

y —-c.f. 

^ k 

MSC 


Symbols 

k-1 

q2 

y^-c.f. 

^ k 

MSS 

• 

T reatments 


^ k 

MST 

MST 

lyiSE ~ ' error d 0 

Error 

HBM 

nsn 

(By subtraction) 

MSE 


Total 


Z2>«ke 2 - C.F. 



CO 

m 

3 

II 

/mse 

V k 
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SEh = 


I2MSE 


CD — SEd X t( tti error d.f) 

These designs are supposed to be very efficient but it is very 
difficult to get situations and the required types of experimental units 
for adopting such designs. This procedure can be extended further to 
get hyper-graeco latin square designs which can be used to eliminate 
four or more sources of variation. These designs can be constructed 
by superimposing three or more mutually orthogonal latin squares. 
Utility of such designs is, however, limited. 


Cross over Designs or Switch over Designs 

Another class of designs to remove the heterogeneity of 
experimental material in two directions are the cross over designs 
and like latin square designs they are widely used in animal 
husbandry experiments, where the performance of units is not 
uniform throughout the period. For example, while studying the effect 
of feeds in milk yield experiments, the performance in the morning 
day may be better than that in the evening or performance in the first 
half of lactation period in general is better than that in the second half 
of lactation period. 

Here total experimental periods is divided into t-factorial periods 
i.e. equal to number of treatments or feeds to be tested and the 
treatments are applied to different units in different fractional periods 
in such a way that 

1. Each unit receives all the t-treatments in one or the other period 
i.e. each unit is a complete replicate. 

2. In each fractional period all treatment occur equal number of 
times which holds when number of unit r is multiple of number of 
treatments i.e. r = mr. 

More generally if v treatments are to be tested over v periods, 
one can use p repetitions of a latin square design (or different Latin 
squares) juxtaposed if pv experimental subjects are available. If 16 
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animals are available for experimenting 4 treatments A, B, C, D one 
can use the following designs. 


Periods 

■ 



B 


Animals 


B 

2 

3 

B 

5 

6 

7 

8 

9 








1 

D 

B 

C 


B 

C 


B 

B 

D 

B 

C 

D 

B 

C 


2 

B 

H 

D 

C 

B 


C 

B 

B 

C 

D 



D 

B 


3 

C 


B 

B 

C 

B 

B 

D 

C 

B 

D 

D 

c 

D 


B 

4 

D 

c 

B 

B 


B 

B 

C 



C 

B 

D 



Dl 


Here each treatment occurs four times in each of the fractional 
period. Since each unit crosses or switches over or changes to 
different treatments in different fractional periods, so the design is 
called switch over or cross over or change over design. 

Analysis 

Let y ijk is the observation in i ,h unit in j-th period having k-th 


treatment. i= 1, 2, ., r, j, k = 1, 2.t. Arrange the data in the 

following table. 


Periods 

Units 

Total 

1 2 . 

.i. 

__ r 


Pi 




Pi 

P2 




P 2 

■ 


y*i(k) 


• 

Pi 




Pi 

Pt 




p, 

Totals 

Hi R2 . 

.Ri. 

.Rr 



Here ERj — EP, — ET k — = G 

C.F. = G 2 /rt 
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Source 

d.f 

SS 

MSS 

F 

Replications 

r-1 


R 


Periods 

t-1 

St _ cf - 

P 


T reatments 

t-1 


T 

T F 

^ r (t-1 .error df) 

Error 

(r-1) 

(t-1) 

(By subtraction) 

E 


Total 

rt- 1 

y, y.j ( k) _ c.f. 




Rest of the analysis is the same as of the basic designs 


[2E 

~ •»/ X Ha, error d.f) 


Remarks 

Switch over designs are preferred over set of LSD, when units 
with in each factorial period are homogenous. 

Set of L.S.D. can’t be used for t=2 as the error df p(t-1)(t-2) = 0 
whereas switch over designs are most appropriate under this 
situation. 

Example 

Given below is data on milk yield in litres of 4 cows when given 
two feeds in two fractional periods. Give complete analysis using 
crossover design. 
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Table: Cross-over experiment for comparing two feeding 
treatments 


Period 

Animal Number 

in 

1 

2 

3 

4 

Early lactation 

A(10.5) 

B(9.8) 

B(6.4) 

A(9.5) 

36.2 

Late lactation 

B(8.5) 

A(9.2) 

A(6.6) 


-—-_ 

30.8 

Total 

19.0 

19.0 

13.0 

16.0 

67.0 


The calculations are as follows: 


Correction factor = — 7 ' 0 ^ =561 125 

8 


Total SS. = ( 1 0.5) 2 + (9.8) 2 + (6.4) 2 + (9.5) 2 + (8.5) 2 

+ (9.2) 2 + (6.6) 2 + (6.5) 2 -561.125 = 19.075 

Animol CQ 19.0 2 +19.0 2 + 13.0 2 +16.0 2 

Animal SS. ----561.125 = 12.375 


Periods S.S. 


36.2 2 +30.8 2 
4 


561.125 = 3.645 


Treatments S.S. = ( 1Q - 5 + 9 - 2 + 6 - 6 + 9 - 5 ) 2 + (8-5 + 9.8 + 6.4 + 6.5) 2 

4 

-561.125 = 2.645 

Error S.S. = 19.075 - 12.375 - 3.645 - 2.645 = 0.410 
We thus set the ANOVA table 


Anova Table 


Source 

d.f. 

S.S. 

M.S. 

F 

Animals 

3 

12.375 

4.125 

20.12 

Periods 

1 

3.645 


17.78 

T reatments 

1 

2.645 

2.645 

12.90 

Error 

2 

0.410 

0.205 


Total 

7 

19.075 




The calculated F value for testing the significance of the 

treatment effects is 12.90 while F„. 2005) = 18 .51. Thus we conclude 

the non-significance of the treatment effects at 0.05 level of 
significance. 
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Youden Square Design 

A Youden Square Design (YSD) is a design with incomplete 
columns by means of which two sources of variation can be 
eliminated. The rows of a YSD form a RBD and the columns form a 
Balanced Incomplete Block (BIB) design. These are basically 
symmetrical BIB designs by which the block to block variation can be 
eliminated. The k units in each block can be thought of occupying k 
different positions. With the help of YSD the effects of such positions 
can also be eliminated. 

Definition 

A YSD is an arrangement of v treatments in a k x v rectangular 
array such the every symbol occurs exactly once in each row and the 
columns form a symmetrical BIB design with parameters v=b, r=k, X. 

The k units in each block of a symmetrical BIB design can always 
be so arranged that each treatment occupies each position once in 
some block or the other. When a symmetrical BIB design is obtained 
by developing an initial block such an arrangement is evident. 

Following is a YSD with 7 treatments arranged in 3 rows and 7 
columns. 

1 2 3 4 5 6 7 

2 3 4 5 6 7 1 

4 5 6 7 1 2 3 

Example of 7 treatments arranged in 4 rows and 7 columns 

1 2 3 4 5 6 7 

3 4 5 6 7 1 2 

6 7 1 2 3 4 5 

7 1 2 3 4 5 6 

The position effects are evidently orthogonal to the treatments as 
each treatment occurs once in each position. The position effects are 
similarly orthogonal to the blocks as well. Hence, the analysis of YSD 
is the same as that of BIB designs excepting that a components of 
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the sum of squares obtained from the position totals of the 
observation is subtracted from the error sum of squares of the BIB 
design. 

The Youden squares can also be looked upon as an incomplete 
v x v latin square with k suitably chosen columns. If a column or a 
row is omitted from a latin square, the resultant design is always a 
YSD. All these designs Latin Square Design, cross over designs and 
Youden square Design falls in the general category of Row and 
Column designs or designs for two-way elimination of heterogeneity 
where there can be incomplete rows and columns, complete rows 
and incomplete columns, incomplete rows and complete columns, 
complete rows and columns. 

Youden squares are useful in experiments requiring to remove 
the heterogeneity in two directions and where fewer replications of 
the treatments are available. 

Yjj(m) = M- + Pi + Yj+ T m + e ij( m), 

where p is the general mean, pj is the i th row effect V, is the j 1h 
column effect, x m is the m lh treatment effect e jj(m) are random errors 
assumed to be independently normally distributed with mean zero 
and variance a 2 . Let R 1f R 2 ,...,Rk be the k rows totals; C 1f C 2 „.., C v 
be the v column totals, T 1t T 2 ,..., T v be the v treatment totals and G 
be the grand total. Further let Q m be the adjusted m th treatment total 
obtained by subtracting the sum of column means in which m th 
treatment occurs from Tm (m=1,2,..., v). It then follows that 

and the ANOVA table will be as given below: 

The significance of the treatment effects can be tested by 
comparing the F value with F a;v _ 1iVk _ 2v . k+2 . The calculation of treatment 
S.S. is identical to the one for a BIB design and the other calculations 
are routine exercises. 

C D. for comparing any two treatment effects is 
(t«.vk-2vk«-'.?)( yj 2kMS e />.v ) 
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ANOVA Table for an Youden Square Design 


Source 

d.f. 

S.S. 

M.S. 

F 

Rows 

k-1 

y Rif. g 2 

w v vr 

MSR 


Columns 

v -1 

f c i 2 G 2 

H k vr 

MSC 


T reatments 
(eliminating 
rows and 

columns 

v -1 

E 

OJ 

o 

*13 

> WI 

II 

E 

o 

E 

MST 

MST/MSE 

~ F(v-1, error 

df) 

Error 

vk'2v- 

k+2 

By subtraction 

MSE 


Total 

vk-1 

V n 2 

'.I 


f'- 

:p 


* Error S.S. = Total S.S. - Rows S.S. - Column S.S. - Treatment 

(eliminating rows and columns) S.S. 

□ □ □ 






Chapter - 6 


Sampling in Field Experiments 


In agricultural field experiments, the size of the plot is selected in 
order to achieve a prescribed degree of precision for measurement 
of the character of primary interest. We then measure the character 
under study on the whole of the experimental unit i.e. plot. Because 
of the nature of character of primary interest like yield, the plot size 
required is often larger than that needed to measure other 
characters. In order to save expenses and time the measurements 
can be made by sampling a fraction of the whole plot. For example, 
for plant height, the measurement can be made only from say 10 of 
the 200 plants in the plot, for tiller number, count only 1m 2 of the 
15m 2 plot, for leaf area, measure only 20 of the approximately 2000 
leaves in the plot. For such cases like plant height, leaf area etc. it 
may not be always possible or desirable to get the plot wise 
measurement. Here we resort to sampling in each plot and obtain the 
measurements on a certain number of sampling units in each plot 
and subject the data for statistical analysis. 

Standard analysis of variance requires that there is single 
observation per character per experimental unit, which is not directly 
applicable to multi observation data. It can only be applied to the 
average of all samples from a plot, or to the average of all 
measurement made overtime for each plot. We here discuss 
app opriate procedures for directly analyzing such data. 

/ 
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Analysis 

Suppose an experiment is conducted with t-treatments replicated 
r times and let there be s observation made in each plot. We assume 
the following linear additive model for the block design 

Yjjk = p + tj + Bj + ey + n ijk 

where Y jjk is the observation on the k th sampling unit for the i th 

treatment in the j th replicate (i= 1, 2.t; j = 1, 3,.r; k = 1, 

2 .s), p is the general mean effect, tj is the effect of i th 

treatment, Bj is the effect of j ,h replication, e^ is the plot error 
distributed as N (0, a e 2 ), n ijk is the sampling error distributed as N(0, 

o. 


Table-1: Format for the analysis of variance of Data from Plot 
Sampling in a CRD with t Treatments, r Replication, and s 
Sampling Units per plot. 


Source of 

variation 

Degree 

of 

Freedom 

Sum of 
Squares 

Mean 

squares 

Computed 

F 

T reatment 

t-1 




Experimental error 

t(r-1) 




Sampling error 

rt(s-i) 




Total 

rts -1 





Here an additional source of variation can be measured i.e., due 
to sampling variation, which is commonly known as sampling error. 
The format for analysis of variance for data from plot sampling of 
Completely Randomized Design (CRD) and a randomized block 
design (RBD) with t treatment and r replications; and for a split plot 
design with a main plot treatment and b subplot treatment and r 
replication (However, the split plot design shall be discussed later) is 
given in table 1,2 and 3. 

The computation procedure is illustrated for RBD and split plot 
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design with the help of an example.' Here the only distinct feature of 
the analysis variance for data from plot sampling is the part involving 
sampling error. 


Tabled: Format for the Analysis of Variance of Data from Plot 
Sampling in a BSD Design with t Treatments, r Replications and 
s Sampling Units per Plot 


Source of variation 

Degree 

of 

Freedom 

Sum of 
Squares 

Mean 

squares 

Computed 

F 

Replication 

r-1 




T reatment 

t-1 




Experimental error 

gBRSn 




Sampling error 

I93H 




Total 

rts -1 





Table-3: Format for the Analysis of Variance of Data from Plot 
Sampling in a Split-Plot Design with a Main Treatments, b 
Subplot Treatments, r Replications and s Sampling Units per 
Plot 


Source of variation 

Degree of 

Freedom 

Sum of 

Squares 

Mean 

squares 

Compute 

d F 

Replication 

r-1 




Main plot treatment (A) 

a-1 




Error (a) 

(r-1)(a-1) 




Subplot treatment (B) 

b-1 




A x B 

.(a-O(b-l) 




Error (b) 

a (r-1) (b-1) 




Sampling error 

abr(s-l) 




Total 

rabs - 1 





Randomized Block Design 


An experiment with nine treatments in four replications was 
“ ln ( Randomi2ed B 'ock Design. Data on tiller count, 
«,,.™ 2 „ 2 „ „ mpli „ g unllj b 
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Tiller Count (no.Mhills) of Rice tested under nine fertilizer 
treatments in a RBD experiment with four replications end four 
sampling units (S^ S 2 , S 3 and S 4 )_ 


Treatments 

Rep. 1 

IIP 

IIZuTi 

.11 



Rep.Ill 

Rep.IV 


im 

m 

S 2 

S 3 

S 4 

Si 

S2 

S3 


Si 

S2 

S 3 

S 4 

m 

S 2 

S3 

S 4 

i 

31 

24 

26 

20 

25 

23 

26 

31 

33 

25 

34 

22 

38 

40 

39 

27 

2 

46 

44 

35 

45 

60 

58 

40 

44 

68 

62 

60 

60 

38 

55 

38 

62 

3 

41 

46 

60 

48 

47 

38 

41 

68 

50 

46 

53 

52 

49 

63 

55 

71 

4 

42 

48 

41 

52 

67 

63 

74 

51 

78 

58 

72 

71 

55 

39 

50 

60 

5 

48 

66 

57 

50 

48 

74 

53 

48 

54 

40 

47 

60 

52 

50 

38 

74 

6 

65 

60 

55 

41 

53 

49 

57 

52 

72 

55 

44 

53 

61 

55 

44 

50 

7 

55 

42 

67 

51 

44 

48 

74 

55 

78 

50 

77 

51 

70 

78 

68 

78 

8 

60 

58 

55 

47 

42 

55 

66 

62 

44 

55 

62 

72 

66 

75 

82 

64 

9 

77 

65 

77 

45 

52 

45 

43 

50 

66 

57 

55 

59 

51 

53 

48 

47 


Computations 


Construct the replication x treatment table of totals and compute 
replication total (R), the treatment total (T) and the grand total G. 

The Replication x Treatment Table of Totals Computed from 
Data. 


Treatment 

Number 

Tiller Count Total (RT) 

Treatment 
Total (T) 

warn 


wmm 


1 

101 

105 

114 

144 

464 

2 

170 

202 

250 

193 

815 

3 

195 

194 

201 

238 

828 

4 

183 

255 

279 

204 

921 

5 

221 

223 

201 

214 

859 

6 

221 

211 

224 

210 

866 

7 

215 

221 

256 

294 

986 

8 

220 

225 

233 

287 

965 

9 

264 

190 

237 

199 

890 


1790 

1826 

1995 

1983 


Grand total (G) 

. 




7594 
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C.F. = G 2 /trs = (759 ^ - = 400478.00 

9 x 4 x4 

Total ss= Xx 2 -C.F. 

= [(31 ) 2 + (24) 2 +.(47) 2 ] - 400478 = 26535 

VR 2 

Replication SS = —-C.F. 

ts 

(1790) 2 + (1.826) 2 + (1995) 2 + (1,983) 2 „„„„„ 

= --'- 5 - 7~~, - 1 -—-— - 400478 = 930.027 

9 4 


Treatment SS = 


it 2 

rs 


C.F. 


(464) 2 + (815) 2 +... + (890) 2 
(4) (4) 


-400478 = 11815.97 


Experimental 


errors SS = 


£(RT) 2 

s 


C.F. - Replication SS - 


Treatment SS 

(101) 2 + (105) 2 + (114) 2 +... + (199) 2 

4 .". . 


- 400478 - 930.027 - 11815.97 = 4720.972 
Sampling error SS = Total SS - (sum of all other SS) 
= 26535.97 - (930.027 + 11815.9 + 4720.972) = 9069 
The format of analysis of variance is given in table 2. 


Analysis or variance (RCB with Data from Plot Sampling) of Data 


Source of 

variation 

) Degree of 
Freedom 

Sum of 
Squares 

Mean 

squares 

Computed 

F 

Replication 

3 

930.03 

310.01 


T reatment 

18 

11815.97 

1477.00 

7.5 r* 

Experimental error 

24 

4720.97 

196.71 

r 2.34** 

Sampling error 

108 

9069.00 

83.97 


Total 

143 

26535.97 




** Significant at 1% level. 
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The mean square (s^) is first tested against s 2 2 if (i) Si 2 is 
significant than treatments are tested against s^ and if (ii) s/ is not 
significant then treatments are tested against the pooled mean 
square of s-\ 2 and s 2 2 . Here s/ is significant, so we test the treatment 
against s^. F(8, 24) at 1% level of significance is 3.36 which is less 
than the computed value of F, so treatments differ significantly. 



where s^ is the experimental error MS in the analysis of 
variance. For our example, the standard error of the difference 
between any pair of treatments is: 


1 2(196.71) 
V (4) (4) 


4.96 


CD at 1% = t ( .oi, 24 ) x S.E d = 2.79 x 4.96 = 13.83 

Here the estimates of the sampling error variance and of the 
experimental error variances are: 



where s, 2 is the sampling error MS in the analysis of variance 

For our example, the two variance estimates and their corresponding 
cv values are y 

sf = 83.97 

2 196.71-83.97 

s e --:-- 28.18 


CV (S) = xl 00 = 17.38% 

o*:. 73 

CV ( E ) = X100 = 10.067% 


52.73 
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Split - Plot Design 


An experiment was conducted in a split-plot design involving 
eight management levels as main-plot treatments and four of 
nitrogen application as subplot treatments. There are three 
replications. The data on plant height measured on two single-hill 
sampling units per plot, are given below. 


Height of rice plants measured on two sampling units (Si and S 2 ) 
per plot, from a Split-Plot Experiment involving eight management 
levels (Mi, M 2 ,.M 8 ) and four times of Nitrogen application (T 1f T 2 , 


T 3 , and T 4 ) with three replications. 
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BB 

99.5 

98.6 

101.5 

101.0 

111.3 

109.8 


m 7 

90.3 

91.3 

96.1 

94.2 

100.0 

98.6 


Ms 

108.9 

110.0 

111.3 

113.4 

100.0 

98.0 

T4 

Mi 

108.3 

110.0 

106.8 

105.7 

118.0 

115.6 


m 2 

89.6 

ko.s 

116.0 

121.5 

93.4 

101.5 


Ms 

93.5 

99.6 

115.6 

117.8 

119.2 

121.5 


m 4 

104.9 

107.8 

105.6 

116.9 

121.5 

124.6 


Ms 

100.0 

101.3 

116.5 

119.6 

126.8 

127.6 


Me 


96.0 

101.5 

102.8 

120.3 

121.8 


m 7 

95.8 

97.8 

100.8 

100.9 

111.5 

116.5 


Me 

112.8 

113.8 

95.8 

94.3 

112.5 

114.9 


Let the main plot factor is A, subplot factor is B. Denote the levels 
of factor A by a, the levels of factor B by b, the number of replication 
by r and the number of sampling units per plot by s. 


Step (i) Construct Replication x factor A two way table of totals. 


The Replication x Management Level Table of Totals (RA) 
Computed from Data. 


Management 

Level 

Plant Height Total (RA) 

Management 
Total (A) 

Rep.l 

Rep.ll 

Rep.lll 

M 1 

839.3 

874.2 

908.9 

2622.4 

m 2 

740.6 

920.3 

827.3 

2488.2 

m 3 

748.2 

889.8 

917.0 

2555.0 

m 4 

783.8 

884.6 

909.0 

2577.4 

M s 

808.4 

859.1 

878.1 

2545.6 

m 6 

788.9 

838.9 

918.0 

2545.8 

m 7 

764.7 

788.9 

855.5 

2409.1 

m 8 

870.9 

855.6 

841.9 

2568.4 


6344.8 

6911.4 

7 055. 7 


Grand total (G) 




20311.9 


C.F. 


Gl = (20311.9)» 
rabs (3)(8)(4)(2) 


Step (ii) Construct factor A x factor B two way table of totals 
(AB) 
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The Management Level x Time of Nitrogen Application Table of 


Management Level 



Plant Height Total (AB) 

Ti 

t 2 

t 3 

647.9 

665.1 

645.0 

617.5 

636.9 

621.3 

626.1 

636.6 

625.1 

616.8 

656.0 

623.3 

616.1 

644.6 

593.1 

628.0 

658.7 

621.7 

608.6 

606.7 

570.5 

634.7 

648.0 

641.6 

4995.7 

5152.6 

4941.6 


Total SS 


LX -C.F. 

r -\2 . / 


T 4 


664.4 


612.5 


667.2 


681.3 


691.8 


637.4 


623.3 


644.1 


5222.0 


= [(102.5r + (108.5r + ... + (114.9) - 2148819 = 4992.63 

LR 2 

Replication SS =-C.F. 


(6344.8) 2 + (6911.4) 2 + (7055.7) : 
(8) (4) (2) 

LA 2 

A (management level) SS = —- 


-2148819 = 4412.692 


C.F. 


(2622.4) 2 + (2488.2) 2 +... + (2568.4)' 


(3) (4) (2) 


-2148819 = 1212.289 


Error (a) SS = 


I(RA)‘ 


- C.F. - Replication SS - A SS 


(839.3) 2 + (874.2) 2 +... + (841.9) 2 
(4) (2) 

- 2148819 - 4412.692 - 1212.289 = 3053.181 

LB 2 

B (time of application) SS =-C F 

ras 


_ (4,995.7) 2 + (5152.6) 2 + (4941,6) 2 + (5222.0) 2 

(3)(8)(2) 


-2148819 
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= 1076.654 

Step (iii) Construct Replication 
table of totals (RAB) 
Management Time of _ 
Level_Application 


x factor A x factor B three way 





Plant Hei 

.1 I 


211.0 


218.9 


191.1 


218.3 


180.1 


193.4 


187.0 


180.1 


188.1 


185.2 


181.8 


193.1 


184.8 


205.0 


181.3 


212.7 


207.1 


194.0 


206.0 


201.3 


203.7 


196.1 


198.1 


191.0 


188.0 


201.5 


181.6 


193.6 


212.8 


212.6 


218.9 


226.6 


|ht Total (RAB] 


220.7 


221.6 


219.4 


212.5 


226.2 


237.2 


219.4 


237.5 


226.3 


211.6 


218.5 


233.4 


214.3 


233.0 


214.8 


222.5 


209.5 


215.4 


198.1 


236.1 


210.9 


221.2 


202.5 


204.3 


198.8 


198.1 


190.3 


201.7 


224.1 


216.7 


224.7 


216.2 


224.6 


234.5 


233.6 


211.2 


206.3 


214.9 


194.9 


211.7 


239.8 


224.8 


240.7 


217.7 


218.0 


227.2 


246.1 


199.5 


235.2 


189.0 


254.4 


213.4 


241.4 


221.1 


242.1 


221.8 


207.1 


198.6 


228.0 


197.8 


218.7 


98. 



AxBSS 


I(AB) : 


-C.F.-BSS-ASS 
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(647.9) 2 + (665.1) 2 + + (644.1) 2 

(3)(2) 

- 2148819-1076.654-1212.289 = 981.5536 

Error (b) SS= -C.F. - Replication SS - A SS 

s 

- Error (a) SS - B SS - A x B SS 

= (211.0) 2 + (220.7) 2 +.+ (227.4) 2 

2 

- 2148819-4212.692-1212.289 

- 3053.181 - 1076.654 - 981.5536 = 3314.981 
Sampling error SS = Total SS - (Sum of all other SS) 

= 14992.63 - (4412.692 + 1212.289 +. 3053.181 
+ 1076.654 + 981.5536 + 3314.981) = 941.285 


Analysis of Variance (Split-Plot Design with Data from Plot 
Sampling) 


Source 

Degree 

of 

Freedom 

Sum of 
Squares 

Mean 

Square 

Computed 

F 

Replication 

2 

4412.692 

2206.346 


Management level 
JA} _ 

7 

1212.289 

173.1841 

< 1 

Error (a) 

14 

3053.181 

218.08 

(E a ) 


Time of application 
JB)_ 

3 

1076.654 

358.8848 

5.20** 

A x B 

21 

981.5536 

46.74065 

<1 

Error (b) 

48 

3314.981 

69.06 

(Eb) 


Sampling error 

96 

941.285 

■-1— yj. 

9.80 (E,) 


Total 

★ * _ ■ • r * . * _ . 

191 

14992.63 

-i- 91 _ 



= significant at 1 % level 


The result indicates that only the effect of the time of nitrogen 
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application is significant. 


For pair comparison, compute the standard error of the mean 
difference following the appropriate formula but with one modification 
- multiply each divisor by s, the sample size in each plot. For 
example, to compare two subplot treatments at the same main-plot 
treatment, the standard error of the mean difference is; 


S.E d = 



'2x 69.06 


3x2 


4.80 


Where Eb is the error (b) MS from the analysis of variance. And, 
to compare two subplot treatments (averaged over all main-plot 
treatments) the standard error of the mean difference is: 


S.E d = 



2x 69.06 
3x8x2 


1.70 


Now the C.D. values can be calculated and significance of 
treatment means can be tested. 

Compute the estimates of two variance components: the 
experimental error associated with the smallest experimental unit [i.e. 
error (b) in this case] and the sampling error as: 

s§ = ,§bJl§s_ _ 69^)6 - 9.80 _ gg 63 


s| = E s = 9.80 

where E s is the sampling error MS, E b is the experimental error 
S, and s is the number of sampling units per plot. 

CV(E)= iHi x100 =5 - 14% 

CV ( S ) = ^5 x100 =2.96% 

The results indicate a relatively small sampling error compared to 
'he experimental error. 


□ □ n 


Chapter - 7 


Missing Plot Technique 

A missing data situation occurs whenever a valid observation is 
not available for any one of the experimental units. Poor germination, 
physical damage during crop culture, damage due to birds and 
rodents and pest damage are common causes of destruction of 
experimental units. When ever there is a missing plot, the experiment 
cannot be analysed in the usual way, except designs like completely 
randomized design, where we can analyse the data even with 
unequal replications. Experiments are usually planned with equal 
number of observation per cell, which facilitates the analysis. The 
designs CRD, RBD, Latin squares are called orthogonal designs, 
here the block contrasts, treatment contrast and error contrast are 
orthogonal to each other and therefore additive. The effects of the 
treatments, blocks etc. can be estimated and tested with out any 
entanglement (i.e. separated). Such a situation in RBD and Latin 
square designs does not remain so if there is a missing plot. The 
data becomes non orthogonal and the same can not be analyzed in 
the usual way of analysis of variance. Forced with such situation, the 
experimenter would still like to obtain all the pieces of information 
possible from the remaining observations. 

In response to such situations, several analytical procedures for 
handling such experiments have been developed, which are. 

(i) Estimating the missing value(s) using principle of Least 
Square i.e. minimizing the error sum of squares. 
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(ii) Method of iteration. 

(iii) Method of fitting constants and, 

(iv) Analysis of data with missing observations by the technique 
of analysis of covariance. 

Here we shall describe the first two methods of analysis of data 
with missing observations: 

Analysis by estimating the missing value 

This method is due to Yates. It consists of estimating the missing 
observations by the value which makes the error sum of squares 
minimum. In order to get such estimates, first unknown values are 
substituted for missing observations. The error sum of squares with 
the designs is then expressed as a function of this unknown. Next, by 
differentiating the error sum of squares with respect to the unknown 
and equating differentiates to zero, as many linear equations in the 
unknown as the number of the missing observations are obtained. 
Solutions of these equations provide the estimated values of the 
missing observations. 

The data are then completed by substituting these estimated 
values and then analysed by the usual techniques appropriate for the 
design. 

Here it is emphasized that an estimate of the missing data 
obtained through this technique does not supply any additional 
information to the incomplete set of data, once the data is lost, no 
amount of statistical manipulation can retrieve it. What the procedure 
attempts to do is to allow the experimenter to compute the analysis of 
variance in the usual manner (i.e. as if the data were complete) 
without restoring to the more complex procedure needed for 
incomplete data sets. 

From this analysis, the correct error sum of squares is obtained 
but not the treatment sum of squares. In such case, the correct 
treatment sum of squares is obtained by subtracting the correct error 
sum of squares from “within block sum of squares" obtained from the 
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incomplete data by analyzing as CRD, ignoring the treatments. The 
degrees of freedom of the error sum of squares and the total sum of 
squares is reduced by the number of missing observations. 


One Missing observation in RBD: 

Let X be the value of the missing observation 
r be the number of replications 
t be the number of treatments 


Bj’ be the total of available observation in the i th block (with 
the missing observations) 

Tj* be the total of available observations of the j ,h treatment 
(for which the observation is missing) 

G’ be the grand total of all the available observations. 

Error S.S. = Total S.S. - Replication S.S. - Treatment S.S. 

k 2 ^ 




? + x 2 - 




2 + (B i +X)‘ 


t 



This expression is differentiated with respect to X and the 
differential is equated to zero. The solution of the resulting equation 
gives the estimate of X, the missing observation, when the error 
variance is minimum. 


The estimated value of x is given by 
- rBj'+m-G' 

(r—1) (t—1) 

The analysis is carried out by substituting the estimated value of 
X with the following changes: 

(i) One d.f. is to be subtracted from the d.f. corresponding to 
error sum of squares and total sum of squares. 
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(ii) Adjusting the treatment sum of squares and total sum of 
squares by subtracting from it a quantity. 

(B| +tTj -G') 2 
t (t—1) (r—1 ) 2 

Standard error for testing the significance of the difference 
between treatment means. 

(a) S.E. of difference of two treatment means not involving 
missing value 



V r 


(b) S.E. of difference of two treatment means one of which 
involves the missing value. 

SE d = 



Example -1 


To find out the best source of nitrogen at 60 kg/ha, an 
experiment was conducted on paddy in RBD with 5 sources of 
nitrogen in 4 blocks at Paddy breeding centre, Coimbatore, Tamil 
Nadu. The yield data for different treatments are given as 


Yield of grain in kg/plot 



TOTAL [95.6 194.3 |127.5' [ 83.1_[94.0_ 494.5 


The observation relating to application of ammonium chloride in 

























the second block is missing. Analyse the data. 


Calculations 

r =4, t = 5, Bi= 78.9, Tj = 94.3, G' = 494.5 

(i) The estimate of missing value, X, is obtained as 

I I I 

rB , + tTj - G 4x78.5 + 5x94.3 - 494.5 


X = 


( r — 1 )(t — 1) 


3x4 


= 24.4 


I A 

It is to be noted that X is not the value that would have been 
obtained if the value of S 2 was not missing in second block. It is the 
value which, when substituted for missing observation, will enable us 

to obtain through the usual method of analysis an unbiased estimate 
of error variance. 

We now insert the estimated missing value and carry out the 
analysis of variance according to the usual procedure for an R.B.D. 
except for subtracting 1 d.f. from total S.S. as well as from the d.f. for 
error S.S. 

(ii) Calculation of Sum of Squares 

G.T. = 494.5 + 24.4 = 518.9 

c. F . .i5ili s 2H2i. 13462 . M 

rt 20 

Total S.S. = Yjf - C.F. = 14121.21 - 13462.86 = 658.35 


< j 


Block S.S. = V — - C.F. = 13694.87 -13462.86 = 232.01 

— t 


B 


j 2 

Treatment S.S. = L-C.F. 

j r 

= 13806.73 -13462.86 = 343.87 

Error S.S. = Total S.S. - Block S.S. - Treatment S.S. 
= 658.35 - 232.01 - 343.87 = 82.47 
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While the error mean square is an unbiased estimate of the error 
variance, the treatment S.S. is an over estimate and has to be 
corrected by subtracting from it a quantity. 

(Bj +tTj -G') 2 (78.9 + 5x94.3-494.5) 2 _ 17Qg 

t(t-1)(r-1) 2 5x4x9 


Therefore, corrected treatment S.S. = 343.87 - 17.36 = 326.51 


Similarly corrected total S.S. = 658.35 - 17.36 = 640.99 

(iii) Analysis of variance 
ANOVA 


Source 

d.f. 

S.S. 

M.F. 


Blocks 

3 

232.01 

77.34 

|| 


4 

343.87 

85.97 

11.46** 

isaiffli 

4 

326.51 

81.63 

10.88** 

Error 

11 

82.47 

7.50 

- 

Total 

18 

640.99 

35.61 



** indicates significance at 1 percent level 

(iv) Standard Errors 

(a) S.E. the difference between two treatment means not involving 


the missing value. 

SE d = = 1,936kg/plot 

V r V 4 


(b) S.E. the difference between two treatment means one of which is 
a missing value. 


SE d = 



2 ( _ 1 _ 

r + [r(r-1)(t-1) 


jl 

Jj 


7.50 


2 
— + 


1 1 


4 4x3x4 'J 


2.13kg/plot 


Two missing values in R.B.D. 

We have discussed earlier one method in that analysis of 
variance for any number of missing observations can be done by 
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expressing the error sum of squares as function of the unknown and 
then by differentiating with respect to the unknowns and equating the 
resulting equations to zero, we get as many equations as the number 
of unknowns. By solving these equations we get the estimates of the 
missing values. By proceeding in the like manner, we can analyse the 
data. 

Besides this there is a method known as iterative method (Yates 
1933). 

Iterative Method 

The iterative method of analysis is used when more than one 
observation is missing. Suppose values of two observations are 
missing in R.B.D. These missing values can be (i) in the same block 
(ii) in different blocks and belonging to different treatments. The 
estimates of the missing values can be obtained by the method of 
iteration or successive approximations suggested by Yates. The 
missing values are denoted by X and Y and X a and Y u , a = 1, 2. ... p, 
denote the values of X and Y in a ,h iteration process which terminates 
in p steps. The method consists of substituting an approximate value, 

Y-i (usually the mean value of the available observations), for one of 
the missing observations and the other value is computed as in case 
of a single missing observations, by the formula: 

^ rBj + tT: — G 
X= (r — 1 )(t — 1) 

The value X! is now substituted for X and the second iterated 
value of Y namely Y 2 is computed using the formula given above. 
Using this value of Y 2 , the value of X namely X 2 , is computed again 
using the same formula. The Process is repeated till each of the 
values of X and Y obtained from two consecuting iterations are nearly 
the same. The analysis is then done in the usual way except that 
from the total SS and the error SS, 2 d.f. are subtracted. 

The error mean square so obtained is an unbiased estimate of 
the variance. The adjusted SS for treatments are calculated as 
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Ignoring the treatments and without substituting for the missing 
observation, we obtain the between and within blocks SS. Error Sum 
of Squares is then subtracted from this within blocks SS to provide 
the adjusted treatments SS. To obtain a standard error for the 
comparison of two treatment means for A and B both having missing 
values, we assign an effective number of replicates to each treatment 
say r 1 and r 2 . Which are calculated by the following rules: 

(i) Any replicate of treatment A is counted as 1 when both the 
treatment A and B are present in the replicate. 

(ii) Counted as (t-2)/(t-1) when A is present and B is not present. 

(iii) Counted as zero when A itself is missing 

Then the standard Error of difference between treatment means for A 
and B is 

SE (d) = y/MSEXl/t^ + 1/r 2 ). 

and CD = SE (d) x ta at error d.f. 

Example 2 


In the example discussed in example 1, assume that the yield of 
S 4 in third block is also missing. Analyse the data. 

(i) We prepare the following two-way table of treatment and 
blocks putting X and Y for the missing values: 


Yield in kg/plot 


Replications 

T reatments 

Total 


1 

2 

3 

4 

5 


1 

25.4 

32.5 

37.5 

22.5 

20.5 

138.4 

II 

17.3 

Y 

25.4 

14.7 

21.5 

78.9 

III 

22.4 

28.4 

30.1 

X 

23.5 

104.4 

IV 

30.5 

33.4 

34.5 

22.4 

EH1 

149.3 

TOTAL 

95.6 

94.3 

127.5 

59.6 

94.0 

471.0 


(ii) Let Y, = 31.4 (the mean value over replication) for the value 
of Y, thus 


X 


i 


4x104.4 + 5x59.6-502.4 


= 17.77 


3x4 


































A . v/ 4x78.9 + 5x94.3-488.77 

Again Y 2 =-= 24.86 

3x4 


X* = 


Y, = 


Xa = 


4 x 104.4 + 5 x 59.6 - 495.86 

= 18.31 

3x4 

4x78.9 + 5x94.3-489.31 

= 24.82 

3x4 

4 x 104.4 + 5 x 59.60 - 495.82 

= 18.31 

3x4 


v 4x78.9 + 5x94.3-489.31 QO 
14 = 1 --— - = 24.o2 


x 4 = 


3x4 

4x104.4 + 5x59.6-495.82 
3x4 


= 18.31 


Since the pairs (X 3 , Y 3 ) and (X 4 , Y 4 ) are same i.e. (18.31, 24.82), 
we now stop further. These are the estimated values of X and Y. we 
insert these values in the table and analyse the completed data. 
Here, since there are two missing observations, we have to subtract 
two d.f. each from the d.f. for the total S.S. and error S.S. 


Calculation of Sum of Squares: 

_ (QiL2l = ( 514 -1L = 13214.94 
rt 


C.F. 

Total S.S. = 


4x5 

??y,-c.F. 


' i 


= 13923.53-13214.94 = 708.59 

Replication $S = j^B 2 - C.F. 

= 13450.81 - 13214.94, = 235.87 

Treatment = -£T -C.F. 

r i 1 

= 13621.21 -13214.94 = 406.27 

Error S.S. = Total SS - Treatment S.S. - Replication S.S. 
= 708.59 - 406.27 - 235.87 = 66.45 
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Calculation of adjusted Treatment S.S. 

We now have to analyse data of the original table where there 
are two missing values (i.e. analysis for the available information) 


r p - 19 T -) 2 _ ( 471 . 0) 2 
18 " 18 


12324.50 


Total S.S. = Sly' -C.F. 


I I 


= 12973 . 60 - 12324.50 = 649.10 

02 

Replication S.S. = 

Where tj is the number of treatments in the i ,h block 

= (138-4) 2 (78^ (104.4) 2 + (1 48 . 3)1 -1 232 4.50 

5 4 4 5 

= 245.65 

Within Replication S.S = Total S.S.- Replication S.S. 

= 649.10 - 245.65 = 403.45 

The within replications S.S. contains the correct treatments S.S. 
+ correct error S.S. Therefore by subtracting form it the error S.S. 
which we have earlier calculated we get the correct treatment S.S. 


Therefore, correct treatment S.S. = 403.45-66.45 = 337.00 

ANOVA _ 


Source 

d.f. 

S.S. 

M.S. 

F 

Replication 

3 

245.65 

81.88 


T reatments 
(Adjusted) 

4 

337.00 

84.25 

12 .68** 

_i- 1 --- L - 

Error 

10 

66.45 

6.645 


Total 

17 

649.10 

i 



Standard Errors 

(a) S.E. of difference of two treatment means one of which 
involves a missing value. 
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SE d 


2, t(t — 1)(r — 1) l o2 

r r[(r-1) 2 (t-1) 2 —1]J 


= 2.00 kg/plot 

(b) S.E. of difference between the means of the treatments 
containing estimates of the missing value is 

SE d = 


= 2.198 kg/plot 





(c) S.E. of difference between the means none of which involves 
a missing value 

2x6 4 64 - 5 - = 1.823 kg/plot 



One missing observation in Latin Square design: 

The procedure is to first obtain the estimates of missing 
values X, by the formula 

t (R'+C'+T)-2G' 

(t — 1 )(t — 2 ) 

where 

t is the number of treatments 

R’ is the total of available observation in the row with missing 
value. 

C’ is the total of available observations in the column with 
missing value. 

T is the total of available observation for the treatment with 
missing value. 


105 


Missing^ Plot Technique __ 

G’ is the grand total of all the available observations. 

Then the estimated missing value is inserted in the table and the 
analysis of variance is carried out as per the usual procedure of latin 
square design but there again we have to subtract 1 d.f. each from 
the d.f. for total S.S. and the error S.S. 

Example 

In order to find the effect of mineral treatments on Paddy, an 
experiment in latin square design was laid down at Agri. Res. Station, 

Manoda (Mysore) during 1990. The details of the experiment are 
given below: 



T reatments 

Dose 

1 . 

Zinc Sulphate 

5.60 kq/ha 

2 . 

Zinc Sulphate 

11.21 kq/ha 

3. 

Borax Sulphate 

22.42 kq/ha 

4. 

Copper Sulpahte 

-w____ 

5.60 kq/ha 

5. 

Copper Sulphate 

M_ I _ 

11.21 kq/ha 

6 . 

Control 

■-w___ 


Plot Size: 1/100 


Yield of grain in kg/plot 


Row 

1 

2 

3 

4 

5 

6 

Total 

1 

(6) 

4.08 

(5) 

3.63 

(4) 

3.86 

(3) 

2.72 

(2) 

2.72 

(1) 

5.44 


2 

(5) 

5.44 

(4) 

3.63 

(1) 

2.72 

(6) 

(3) 

2.49 

(2) 

6.75 


3 

(4) 

5.61 

(1) 

1.90 

(5) 

2.81 

(2) 

3.01 

(6) 

2.61 

(3) 

4.25 


4 

(3) 

5.27 

(6) 

4.25 

(2) 

3.12 

(5) 

4.82 

(1) 

4.82 

(4) 

3.06 


5 

(2) 

2.07 

(3) 

2.95 

(6) 

2.49 

(i) 

2.75 

(4) 

3.35 

(5) 

2.38 


6 

(1) 

4.48 

(2) 

2.52 

(3) 

2.55 

(4) 

2.69 

(5) 

1.62 

(6) 

3.18 


Total 




15.99(C) 



























Note: Figures in bracket indicate the treatment number. Analyse the 
data and draw conclusion. 


Calculation: 

(i) We first estimate the missing value, X, as 
t (R'+C'+T)-2G' 


X = 


(t — 1 )(t — 2) 


_ 6 (21.03 +15.99 +16.61) - 2 x 122.04 = gg 

5x4 

A 

Now, substitute this value of X in the table and proceed for 
analysis 

(ii) Calculation of various sum of squares 

CF = (0^(125^4405 
t 2 36 


Total S.S. = 491.02 - 440.51 = 50.51 
S.S. due to rows = 453.47 - 440.51 = 12.96 
S.S. due to columns = 454.01 - 440.51 = 13.50 
Treatment S.S. = 441.22 — 440.51 = 0.71 

Error S.S. = Total S.S.-S.S. due to Rows-S.S. due to Columns- 
Treatment S.S. 

= 50.51 - 12.96 - 13.50 - 0.71 = 23.34 


(iii) Adjusted treatment S.S. 

While the error M.S. is an unbiased estimate of the error 
variance, the treatment S.S. is an over estimate and has to be 
corrected by subtracting from it the quantity 

[(t-l)T'+ R' + C' - G 1 ] 2 _ [5 x 16.61 + 21.03 +15.99 -122.04] 2 
(t-1) 2 (t-2) 2 " 5 2 x 4 2 

= 0.01 


Therefore, adjusted treatment S.S. = 0.714 - 0.01 = 0.70 


Missing Plot Technique 


107 


(iv) Analysis of variance 


anova 


Source 

d.f. 

S.S. 

M.S. 

F 

_ 

Rows 

5 

12.96 

2.592 

2.111 

Columns 

5 

13.50 

2.700 

2.198 

T reatment 
(unadjusted) 

5 

* 

0.71 

0.142 

<1 

Treatment 

(adjusted) 

5 

0.70 

0.140 

<1 

Error 

19 

23.34 

1.228 

- 

Total 

34 

50.50 




(v) Standard Errors 

(a) S.E. of difference between the means not involving missing 
value 



= 0.640 kg/plot 

(b) S.E. of difference between two treatments means one of 
which involves missing value 


sEh= Is 


2 
— + 


1 


1 


e |_ t (t -1) (t - 2)'J 

= 0.686 kg/plot 


J 1.228 

f 2 

1 ) 

— + 

V 

16 

5x4 J 


Two missing observation in L.S. Design: 

In this case also the method of iteration as illustrated in case of 
R.B.D. can be used except that Xi is computed from 

^ = t (R , -hC l +T , )-2G l 

1 (t-1)(t-2) 
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Standard errors 
Case-1 

When the observations are missing in different rows, columns 
and treatments. 

(a) S.E. of difference of mean of two affected treatments 
SE d = 

(b) S.E. of difference of means of an affected treatment and an 
unaffected one 

2 t(t-3) + 2 (k-1) 1 2 

t t[(t-3)[t(t-3) + 2k]lj e 

Where k, denotes the number of missing observations 

Case-1! 

Where a row, column or treatment is partly, but not completely 
missing, 

(c) S.E. of difference between the means of any two affected 
treatments when a part of a row or a column is missing but not the 
part of a treatment. 

SE d = 

(d) S.E. of difference between the means of an unaffected 
treatment and that of an affected one. 

SE d = 






(e) S.E. of difference between the means of a treatment which is 
partly missing and that of an unaffected treatment mean 



Standard error of difference for the treatments can also be find 
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out by 


SEd = 




+— fe 


■j 
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Where r 1 and r 2 are the effective number of replications for the 
two treatments which are having missing value. 

A treatment is given a score 1 if in a row the other treatment is 
present in row and column, 2/3 if the other treatment is not present in 
either row or column, 1/3 if the other treatment is not present in both 
rows and columns and zero if the treatment itself is missing. Let the 
design be ___ 


(A) 

(B) 

C 

D 

B 

c 

D 

A 

C 

D 

A 

(B)_ 

D 

A 

(B)__ 

c 


For comparing treatments A and B the effective no. of 
replications for A is: 0+2/3 + 1/3 +1/3 = 4/3 


The effective number of replications for B = 0 + 2/3 + 0 + 0 = 2/3 

For comparing A and C, the effective no. of replications for A is 0 
+ i -hi +1 =3, and the effective replication for C is 2/3 + 1 + 2/3 + 1 = 
10/3. 


For comparing A and D, the effective number of replication for A 

2 2 7 

starting from 1 st row = 0 + — + — + 1 = — 

3 3 3 

The effective number of replication for 

2 2 7 
D = - + i + o + - = - 

3 3 3 


SE 


(A-D) 


If3 

3) 

1 - 

+— 

Vl7 

7 J 


2 

e 


□ □ □ 



Chapter - 8 


Factorial Experiments 



In a factorial or a multifactor experiment we study the effect of 
several factors simultaneously each factor having a number of levels. 
In these experiments we are interested in estimating and testing the 
treatment contrasts that define main effects and interaction effects of 
different factors. Consider the result of a trial conducted to measure 
the effect of nitrogen and phosphorus on the yield of rice crop. 
Suppose we have two levels (N 0 and of nitrogen and two level of 
phosphorus (P 0 and P^. The yield of rice crop in q/ha is given as: 


Rice Yield in q/ha 


Nitrogen 

Phosphorus 

0 kg N/ha (N 0 ) 

60 kg N/ha (N^ 

0 kg P 2 Os/ha (P 0 ) 

N 0 Po 

10.0 

N,P 0 

30.0 

30 kg P 2 C> 5 /ha (P^ 

N 0 P, 

20.0 

N,P, 

40.0 


Effect of nitrogen at P 0 level of phosphorus = 30 - 10 = 20.0 q/ha 


Effect of nitrogen at P 1 level of phosphorus = 40 - 20= 20 q/ha 

Effect of phosphorus at N 0 level of Nitrogen = 20 -10 = 10 q/ha 

Effect of phosphorus at M 1 level of Nitrogen = 40 - 30 = 10 q/ha 

As the effect of nitrogen (Phosphorus) is the same at both the 

levels of phosphorous (nitrogen), there is no interaction between 

nitrogen and phosphorus. If the yields of rice shows the following 
pattern. 






Factorial Experiments 


Rice yield in q/ha 


Nitrogen 

Phosphorus 

0 kg N/ha (N 0 ) 

60 kg N/ha (N^ 

0 ka P20 5 /ha (P 0 ) 

N 0 Po 10.0 

NiPo 30.0 

30 kg P 2 0 5 /ha (P^ 

N 0 Pi 20.0 

NiPi 50.0 


Effect of nitrogen at P 0 level of phosphorus = 30 -10 = 20 q/ha 

Effect of nitrogen at P ^ level of phosphorus = 50 - 20 = 30 q/ha 

Effect of phosphorus at N 0 level of Nitrogen = 20 -10 =10 q/ha 

Effect of phosphorus at Ni level of Nitrogen = 50 - 30 = 20 q/ha 

As the effect of nitrogen (Phosphorus) are not the same at both 
the level of phosphorus (nitrogen), nitrogen and phosphorus have 
interaction effect. The effects explained above are called simple 
effects and average of these simple effects is called main effects of 
the factor. 

For example for the second set of data 

Main effect of nitrogen = 20+30/2 = 25.0q/ha. 

Main effect of phosphorus = (10.0 +20.0)/2=15.0q/ha 

Interaction is the difference between simple effects, thus either 
30-20.0=10.Oq/ha or 20-10=1 Oq/ha is the measure of interaction 
effect between nitrogen and phosphorus. It interaction exits which is 
fairly common, we should plan our experiments in such a way that 
these can be to estimated and tested. It can not be done if we vary 
only one factor at a time for this purpose we must have multi-level, 
multifactor experiments. 

The experiment in which several levels (or stages) of two for 
more than two factors are tested in all possible combinations are 
called factorial experiments. Here the main effects and interactions 
are studied together. When the levels of all factors are equal, 
experiment is called symmetrical factorial experiment otherwise 
asymmetrical factorial experiment. 

In general, 2 n factorial experiments means n factor each at 2 
levels, 3 n factorial experiment means n factors each at 3 levels. The 
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treatment combination, can be laid out in Complete Randomized 
Design or Randomized Block Design depending upon the nature of 
the experimental material. 

Features of Factorial experiments 

(i) It can furnish information regarding the interaction between 
various factors. 

(ii) If the interaction between the factors is significant, the 
optimum combination of two can be discovered only through 
factorial combinations. 

(iii) If the two factors do not interact, we may speak of the 
response of one factor irrespective of the levels of the other 
factors. 

Symmetrical Factorial Experiment 

2 2 factorial experiment in which two factors A and B each at two 
levels 0 and 1 . There will be four treatment combination. 

00 = a 0 b 0 = 1: A and B both are at the first level. 

10 = a 1 b 0 = a: A at the second level and B at the first level. 

01 = a 0 bi = b: A at the first level and B at the second level. 

11 = a! b t = ab: A and B both at the second level. 

we denote the treatment combinations by small letters and 
treatment total by [ 1 ], [a], [b], [ab]. 

The comparison a-ibi-aobi estimates the effect of A when B is 
held constant at higher level. The comparison aibo-aob 0 estimates the 
effect of A when B is held constant at the lower level. The average of 

these two estimates is called main effect and the difference is called 
interaction, mathematically. 

A = \ +a,b 0 -a 0 b r aob 0 ) = I( a -i)(b+i) 

B = - (a! b, +a 0 b 1 -a 1 b 0 -a 0 b 0 ) = ^(a+1 )(b-1) 
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AB = j (a^ H-aobo-a^o-aob!) = -l(a-1)(b-1) 
How to compute main effects and interaction: 

2 2 Factorial Experiment 


In this notation, the following table shows how to compute the 
main effects and interaction from the treatment totals over ‘r’ 
replications for 2 2 factorial experiment. 


Factorial 

Effect 

Multiplier for 

Treatment total 

Divisor to 
get mean 

S.S. 


RIEIBHi 

2 r 


A 


2 r 


B 

-1-111 

2 r 

K9|H| 

AB 

1 -1-11 

2 r 



The rule to write down the signs of the main effects is to give a 
plus sign to the treatment combination containing the corresponding 
small letters and a minus sign where the corresponding small letter is 
absent. The signs of interaction are obtained by multiplying the 
corresponding signs of the two main effects. The main effect of A is- 

[A]/2r = [(ab) - (b) + (a) -(1) ]/2r 

The quantities [A], [B], .[AB] are called factorial effect totals. 

These four treatments combinations may be compared using 
either a CRD, RBD or LSD. The analysis will be same as that of the 
design considered. 

Analysis 

Let us have 2 2 factorial experiment laid in RBD with Y blocks. 
Thus, the four treatment combination shall be put in Y blocks 
randomly considering the treatment combination as individual 
treatments. Hence, the ANOVA table will look like as below: 

S.E. for testing the difference between means in case of p factor 

experiment 
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2 MSE 
n-p 



V p = 1,2....n 


Anova 


Source 

d.f. 

S.S. 

M.S. 

F 

Blocks 

r-1 

SSB 

SSB/r-1 


Main effect A 

1 

1 

MSA 

MSA/MSE 

Main effect B 

1 


MSB 

MSB/MSE 

Interaction AxB 

1 

I83Z9H1 

MSAB 

MSAB/MSE 

Error 

3(r-1) 

by 

subtraction 

MSE 


Total 

4r-1 

X>' 2 -c.f. 

U 



S.E. main effect = 

MSE , 

-, here n=2, p=1 

r 


S.E. for interaction AB = 


I2.MSE 


as n=p=2 


Critical differences are obtained by multiplying SE by students t 
value at a level of significance at error degree of freedom. 


Yates Method for computing factorial effect totals 

Yates gave a method to compute various effect totals for any 2 n - 
experiment. 

Step-1: Write down treatment combinations systematically in the 
1 st column starting with (1) and then writing a, b, ab and then 
combinations of these with new symbol c. Again the process is 
repeated with new symbol d and so on. 

Step-2: Write down treatment totals from all the replicates in the 
second column against appropriate treatment combination. 

Step-3: Break the values of the second column in to pairs 
starting from (1). In column third write down the sums of pairs in the 
first half while differences in the second half (The first number 
subtracted from second number) 
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Step-4: Again break the values of the third column obtained in 
step 3 into pairs and proceed as in step 3 to get column four and so 
on. 

Remarks: For a 2 n - experiment we are to repeat n- times the 
operations of column 3 and 4 and then the values in the (n+2)nd 
column will be factorial effect totals. The first entry will always be 
grand total. 


Yates method for 2 2 - experiment 


Treat. 

Comb. 

(D 

Total 

(2) 

Col. (3) 

Col. (4) 

Total 

effects 

S.S. due 

to 

different 

effect 

(1) 

[1] 

[IMa] 

[11+[a]+[b]+!ab] 

G 

G 2 /r.2 n 

a 

[a] 

[b]+[ab] 

[a]-[1]+[ab]-[b] 

A 

A 2 /r.2 n 

b 

[b] 

[a]-[ 1 ] 

[b]+[ab]-[1 ]-[a] 

B 

B 2 /r.2 n 

ab 

[ab] 

JabHbl 

[ab]-[b]-[a]+[1] 

AB 

(AB) 2 /r.2 n 


The analysis of variance can be formed. The procedure is 
illustrated with an example. 


Example: 2 2 - experiment in 6 randomized blocks was conducted 
in order to obtain an idea of the interaction; spacing x number of 
seedlings per hole, along with the effects of different types of spacing 
and different numbers of seedlings per hole, while adopting the 
Japanese method of cultivation of rice. The levels of two factors are: 

S : 8” and 10” spacing N : 3 and 4 seedlings per hole. 


The field plan and yield is as follows: 



Analyse the data to find out if there is any significant treatment 
effects? 
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Computations: We apply Yates method to find total effects 


Treat. 

Comb. 



S.S. 


(1) 

610 

1172 

2442 

GT 

(2442) 2 /24=24847.35 

n 

562 

1270 

-104 

N 


s 

663 

-48 

98 

S 

innm 

ns 

607 

-56 

-8 

NS 

(-8) 2 /24 =2.667 


CF = (2442)724 = 24847.35 

ANOVA can be set as follows: 


Source 

Blocks 

N 

S 

NxS 

Error 

Total 


S.S. 


6270.5 

450.667 
400.167 

2.667 

3426.5 

10550.5 


M.S. 

1254.1 

450.667 
400.167 

2.667 
228.433 


1.973 

1.752 

N.S. 

N.S. 

N.S. 


F(.o 5 ;i.i 5 ) tabulated = 4.54 
Result: There are no significant main or interaction effects 
present in the above experiment. 


2 3 — experiment 

If we consider three factors A, B and C each at two levels, then 
there are 8 treatment combinations denoted by (1), a, b, ab, c, ac, be, 
abc. These 8 treatment combinations may be compared in any of the 
designs CRD, RBD or LSD having the same analysis as if there are 8 
treatments. 

In a three factor experiment there are three main effects - A, B, 
C; three first- order interactions - AB, AC, BC; and one second-order 
(or three factor) interaction-ABC. Here main effect A is defined as 






















Factorial Experiments 


117 


A= — [(abc) - (be) + (ac) - (c) + (ab) - (b) + (a) - (1) J 

or A = l( a -1)(b + 1)(c + 1) 

B = l( a + i)(b-1)(c + 1) 

4 

AB = i (a -i)(b-1)(c + 1) 

4 


Alternatively we can also get the expression for effects by using 
Table of signs as follows: 


Effect 

<n_ 

(a) 

(b) 

(ab) 

(c) 

(ac) 

(be) 

(abc) 

Divisor 

M 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

8 

< 

- 

+ 

- 

+ 

_ 

+ 


+ 

4 

B 

- 

- 

+ 

+ 

. 

_ 

+ 

4- 

4 

C 

- 

- 

- 

- 

+ 

+ 

+ 

+ 

4 

AB 

+ 

- 

- 

+ 

+ 

- 

- 

+ 

4 

AC 

+ 

- 

+ 

- 

- 

+ 

- 

+ 

4 

BC 

+ 

+ 

- 

- 

- 

- 

+ 

+ 

4 

ABC 

- 

+ 

+ 

- 

+ 

- 

- 

+ | 

4 1 


Or we can also use Yates method for obtaining the total effects 
and then proceed for the ANOVA and test the hypothesis. 


Example of 2 3 factional experiment 

The following table gives the yield of factorial experiment 
involving 3 factors A, B, C each at two levels laid out in RBD using 
three replication alongwith total effects obtained by Yates technique. 
Analyse the data and draw conclusions. 

C.F. = (132) 2 /24 = 726.0 

( 2 ) Total sum of square (TSS) = ( 2) 2 + (3) 2 .. +(14) 2 - 726.0 = 240 

(3) Sum of square due to Blocks = [(37 ) 2 +(42) 2 + (53) 2 ]/8-776=16.75 

(4) Sum of square due to treatments = 212.68 

(5) Sum of square due to error = 240-212.6 - 16.75 = 1057 


L 
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Table 


1 

Bloc 

ks 


2 

3 


5 

Total 

effects 

S.S. due to 
diff. effects 

Treatment 

Bi 

B2 

B 3 

Total 




J13_ 

2 

3 


8 

17 

44 

132 

Lg_ 

726 

a 

3 

2 

B 

9 

27 

88 

32 

ISH 

1 

b 

2 

a 

5 

7 

31 

14 

36 

RUB 

BUB 

ab 

5 

B 

8 

20 

57 

18 

24 

RUB 


c 

5 

B 

5 

14 

B 

10 

44 

ISH 


ac 

m 

6 

7 

17 

13 

26 

m 

RGB 

4 2 /24=0.58 

be 

6 

BI 

8 

21 

3 

12 

16 

lUSBi 

16 2 /24=10.70 

abc 

10 

12 

14 

36 

15 

12 

0 

IJfBI 

02/24=0.0 

Total 

37 

42 

53 





HI 

212.68 


Anova table 


Source 

d.f. 

s.s. 

M.S. 

F 

Blocks 

2 

16.75 

8.34 

12.69** 

T reatments 

A 

1 

42.70 

42.70 

64.70** 

B 

1 

54.00 

54.00 

81.82** 

AB 

1 

24.00 

24.00 

36.36** 

C 

1 

80.70 

80.70 

122.27** 

AC 

1 

0.58 

0.58 

0.88 

BC 

1 

10.70 

10.70 

16.21*** 

ABC 

1 

0 

0 

0 

Error 

16 

10.57 

0.66 


Total 

23 

240.00 




The main effects and the interaction differ significantly except AC 
and ABC. CD values can be calculated in the usual way as in the 
RBD. 


Disadvantage 

When the no. of factors or their levels are large it would result in 
larger no. of treatment combinations which would require blocks of 
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larger size. This would result in heterogeneity with in block and the 
precision of the experiment will be low. Under such situations 
confounded designs are adopted in which treatments/replicates are 
divided into two or more blocks of small size and treatments are 
randomized in each block separately resulting in increasing precision. 

In general ANOVA for 2 n experiment is _ 


Source 

d.f. 

Replication 

r-1 

T reatment 

2 n -1 

(i) Main effects 

n 

(ii) Two factor interaction 




(iii) Third factor interaction 


f n ] 


(iv) n factor interaction 


'"I 


Error 

(2 n -1)(r-1) 

Total 

r2 n -1 


Asymmetrical factorial experiments 

Asymmetrical factorial experiments are those in which all the 
factors do not have equal no. of levels. These find wide applications 
in agricultural, industrial and animal science experimentations for 
testing several factors at unequal no. of levels. 

Two factors asymmetrical factorial experiment 

Layout: Consider a two factors factorial experiment with 2 
varieties and 3 fertilizers laid in R.B.D. with 5 replications. The six 
possible treatment combinations namely v^, v^, v^, v 2 fi, v 2 f 2 and 
v 2 f 3 are randomized in each replicate separately as follows: 








R 


Vlfl 

117 


V 2 f 3 

74 


Vlf 2 

139 


v 3 fi 

35 


V2^2 

44 


Vlf 3 

129 



v 2 f 3 

155 


v 2 f 2 

71 
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R 3 


Vif 2 

146 


Vif 3 Vif i 

125 112 


v 2 f 3 v 2 f 3 

62 76 


Vif 2 v 2 f 2 

138 55 


v 2 fi 

36 147 


Vlfl V 2 f! 

133 42 


Record the raw data in the layout plan itself as shown above and 
rearrange the data in following date sheet for further analysis. 


T reatment 


Vifl 


Total 





Prepare v x f table of total from totals of treatment combinations 


Total 


Total 


Vi 

V 2 

Vifl 

v 2 fi 

Vlfl 

v 2 f 2 

Vif 3 

v 2 f 3 

V, 

v 2 



So we have Ri, R 2 .R 5 are replication totals 

V 1f V 2 ...are varieties totals 

Fi, F 2 , F 3 .treatments totals 

IRi = ZVj = ZF k = ZY = G = Grand total 
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N = rvf = 5x2x3 = 30 = Total no. of observations 
Correction factor (C.F.) = G 2 /n 


ANOVA Table 


Source 

d.f. 

S.S. 

M.S. 

F 

Replications 

r-1 =4 

IR, 2 /vf-C.F. 

R 


Varieties 

v — 1 = 1 

EV 2 /rf-C.F. 

V 

V/E-Fv.t, error d.f. 

Fertilizers 

f-1 =2 

EF k 2 /rv-C.F. 

F 

F/E-Fm, error d.f. 

Variety x 

(v-1) (f-1) 

I(v,f k )2/r-CF 

VF 

VF/E-F (v-i )(f-i) t 

Fertilizer 

= 2 

SS(V) 

SS(F) 


error d.f. 

Error 

(f v-1) (r-1) 

(by 

subtraction) 

E 


Total 

rfv-1 

IY 2 - CF 




Using F test significance of each component namely variety, 
fertilizer and their interaction variety x fertilizer can be tested against 
error as shown above, 


SE of difference for varieties = 



SE of difference fertilizers = 



SE of difference for V x F = 



Now C.D. value can be calculated by multiplying the SE d with t (H> error 
d .t.) value 


For testing the levels for different factors and interaction, prepare 

V x F table of means by dividing each events of V x F table of total by 
r and compare the various factors using respective 5% C.D. 

V x F mean table 



Vi 

V 2 

Totals 

f, 

Vifi/r 

v 2 f!/r 

f 1 



_ 


_f 3 

- 

v 2 f 3 /r 

1 f3 

JTotals 

Vi 

v 2 

i 
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Example: Give complete analysis of factorial experiment (given 
in layout) on 2 varieties, 3 fertilizers conducted in RBD with 5 
replications. 

Solution: 


Arrange the data in following datasheet and find marginal totals. 

Replications 


Treatment 

1 

II 

mm 

IV 

V 


Vifi 

117 

143 

133 

112 

93 

\mm 

Vif 2 

139 

171 

138 

146 

122 

716 

Vif 3 

129 

155 

125 

147 

137 

693 

v 2 f. 

35 

48 

36 

42 

22 

183 

v 2 f 2 

44 

71 

61 

55 

54 

285 

v 2 f 3 

74 

97 

62 

76 

48 

357 

Total 

538 

685 

555 

578 

476 

2832 


Now prepare V x F table of totals from above treatment totals 
and find totals on both sides 



Vi 

v 2 

Totals 

fi 

598 

183 

781 

f 2 

716 

285 

1001 

f 3 

693 

357 

1050 

Totals 

2007 

825 

2832 


We have r =5, v=2, f=3, t=vf=6, N=rvf=30 G=2832 


C.F. = G 2 /N = = 267340.8 

30 


SS due to replicates = 


ZR i 2 

vf 


-C.F. 


(538) 2 + (685) 2 +.(476) 2 

6 


-267340.8 =2689.1 


SS due to varieties = 



C.F. 
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_ (2007) 2 + (825)' 
15 


-267340.8 = 46570.8 


TFk 2 

SS due to fertilizers =-C.F. 


rv 


= (781) 2 + (10Q1) 2 + (1050)' 


10 


-267340.8 =4105.4 


S.S. due to interaction V x F = 


I(vjfk) 2 

r 


C.F. - SS(V) - SS(F) 


(598) 2 + (716) 2 + ... + (357) 2 
5 


267340.8 


-46570.8 - 4105.4 = 517.4 

Total SS = ZY 2 -C.F.=(117) 2 + (139) 2 +.(48) 2 - 267340.8 


= 56675.2 

ANOVA Table 


Source 

df 

S.S. 

M.S. 

F 

IEEH333MI 

r-1=4 

2689.1 

672.28 

4.8 

Varieties 

v-1 =1 

46570.8 

46570.80 

333.5* 

Fertilizer 

f-1=2 

4105.4 

2052.73 

14.7* 

Varity x 
Fertilizer 

(v-1) (f-1) 

= 2 

517.4 

258.72 

1.9 

Error 

20 

2792.5 

139.63 


Total 

rvf-1 = 29 

56675.2 




F(.os. i. 20 ) - 4.35, F( os. 2 , 20 ) = 3.49 

Since for both varieties and fertilizer, F ca) > F tabulated, so they 
differ significantly, while their interaction is non significant. 


C.D. for varieties at 5% = /—x t 0 = - x 139,63 x 2.09 

V rf V 15 


= 4.13x2.09 = 9.01 



(2x139.63 


x 2.09 


C.D. for Fertilizer at 5%= 


10 
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= 5.28 x 2.09 = 11.04 — 


Now from VxF table of totals prepare VxF mean table by dividing 
each entry in table by 5 and taking means on both sides. 



Vi 

V 2 

Means 

fl 

119.6 

36.6 

78.1 

f2 

143.2 

57.0 

100.1 

f3 

138.6 

' 71.4 

105.0 

Means 

133.8 

55.0 



CD at 5% for V = 9.01, CD at 5% for F = 11.04 CD for VF = N.S. 

Remarks 


I. C.D. should not be worked out for the components indicated non 
significant by F-test in ANOVA table and their means should not 
be compared otherwise misleading results are obtained. 

II. Take 1/F instead of F value when F ca i for a component in less 
than 1. Also interchange the d.f. of F for testing the significance. 

Three factor asymmetrical factorial experiment: 

It is an extension of two factor factorial experiments. 

Layout: Consider a three factor factorial experiment with 2 
varieties (vi,v 2 ), 3 fertilizers (h, f 2 , fa) and two pesticides (p 1t p 2 ) laid in 
RBD with r = 4 replications. 

Total number of treatment combination t = vfp = 2x3x2 = 12 

Total number of observation N = rvfp = 48 

The experiment is conducted in RBD with following 12 treatment 
combinations. 


tl=VifiPi 

t 7 =V 2 f 1 Pi 

t 2 =Vifip 2 

ts=V 2 f 1 p 2 

t 3 =Vif 2 Pi 

tg=V 2 f 2 p 1 

t4=V^ 2 p 2 

flO = V 2 f 2 p 2 

ts=Vif 3 Pi 

fl1 = V 1 f 3 p 1 

f6 = V 1 f 3 p 2 

ti 2 =V 2 f 3 p 2 







Analysis 


The raw data from the layout plan for further analysis be 
arranged in the following data sheet___ 


Treatments _ 

Ri 

r 2 

r 3 

R4 

Total 

VifiPl 





VifiPi 

ViflP2 





VifiP 2 


— 




v 2 f 3 p 2 











Vif 3 P 2 





V2flPl 





V?fiP2 















VifaPl 





V 2 f 3 P2 





Total 

Rt 

r 2 

r 3 

R4 



From the total of various treatment combinations construct PxV, 
VxF and FxP tables of totals 


PxV 

♦ 

VxF 


FxP 



Vi 

V 2 

Total 

■ 

n 

■a 

f 3 

Total 


Pi 

p 2 

Total 

_E2_, 

EH 


Pi 

Vi 

Vlfl 

Vlf2 

Vif 3 

Vi 

n 

ma 

ns 

Fi 

_J2__ 

m 


P 2 

V 2 

v 2 f 1 

v 2 f 2 

v 2 f 3 

V 2 

mm 



f 2 

Total 

Vi 

v 2 


■ 

Fi 

f 2 

F 3 


f 3 



f 3 





■ 





Total 

..—.1;, . 1 

Pi 

p 2 



Hence we have R 1f R 2 , R 3 , R 4 as replication totals 
V 1f V 2 are varieties totals 
Pi> P 2 , F 3 , are fertilizer totals 
Pi, P 2 are pesticide totals 


£Rj - IVj = XF k = ZP e = G = Grand total 
N = r v f p = Total number of observation 
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Correction factor (C.F.) = G 2 /N 

ANOVA Table 


Source 


Replications r-1 =3 


Varieties v-1=1 


Fertilizer f-1=2 



^—CF 
rvp 



££ ^-CFSSM-SStF, 


Pesticides ! p-1=1 


Variety x (v-1) (p-1) 1 II(VjPi)—OF - SS(V) - SS(P) 
pesticides rf _ 

Fertilizer x (f-1) (p'1) = 2 —CF- SS(F)-SS(P) 

pesticides _ rv 


VxFxP (v-1) (f-1) 

(p-1) = 2 



VP VP/E 


FP FP/E 


ZZ(v ,f k P| )- 


- CF - SS(V) - SS(F) 


VFP VFP/E 



Total 


By 

subtraction 


rvfp-1 


-SS(VxF)-SS(P)-SS(VxP)-SS(FxP 


By subtration 


Zy 2 -CF 




All the main effects and interactions are tested against error E 
using F-test with their respective d.f. 

5% CD for any main effect or interaction is given by ^2E/. x t 0 

Where r,* = effective no. of replications for that component and is 

equal to the divisor of respective component of S.S in the ANOVA 
table. 

mean^JdWH' C T PariS ° n PrSpare PxV ' VxF and FxP tables of 
by dividing the entries in table of totals by r,* where ri* is the 
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The significance of difference between varieties, fertilizers, 


pesticides and their interaction can be tested from above mean 


tables using the corresponding C.D. values. 


Experiments with factors each at three levels 

When factors are taken at three levels instead of two, the scope 
of an experiment increases. It becomes more informative. A study to 
investigate if the change is linear or quadratic is possible when the 
factors are at three levels. The more the number of levels, the better, 
yet the number of the levels of the factors cannot be increased too 
much as the size of the experiment increases too rapidly with them. 
Let us begin with two factors A and B, each at three levels say 0, 1 
and 2(3 2 -factorial experiment). The treatment combinations are 

00 = a 0 b 0 = 1 ; A and B both at first levels 

10 = aib 0 = a ; A is at second level and B is at first level 

20 = a 2 b 0 = a 2 ; A is at third level and b is at first level 

01 = aobi = b ; A is at first level and B is at second level 

11 =a 1 b 1 = ab ; A and B both at second level 

21 = a 2 b! = a 2 b ; A is at third level and B is at second level 

02 = a 0 b 2 = b 2 ; A is at first level and B is at third level 

12 = a^ = ab 2 ; A is at second level and B is at third level 

22 = a 2 b 2 = a 2 b 2 ; A and B both at third level 

Any standard design can be adopted for the experiment. 

The main effects A, B can respectively be divided into linear and 
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quadratic components each with 1 d.f. as A L , A Q , B L and B Q . 
Accordingly AB can be partitioned into four components as A L B L , 
Ai_Bq, AqBl, AqBq. 


The coefficients of the treatment combinations to obtain the 
above effects are given as 


Treatment 

totals 

Factorial 

Effects 

[1] 

[a] 

[a 2 ] 

[b] 

[ab] 

[a 2 b] 

[b 2 ] 


[a 2 b 2 ] 

Divisor 

M 

+ 1 

+1 

+1 

+1 

+ 1 

+ 1 

+1 

+ 1 

+ 1 

9r = rx3 2 

Al 

m 

0 

+1 

-1 

0 

+ 1 

n 

0 

+ 1 

6r = rx2x3 

Aq 

+i 

-2 

+1 

+1 

-2 

+1 

+i 

-2 

+1 

CO 

II 

00 

Bl 

n 

-1 

H 

0 

0 

0 

+i 

+1 

+1 

6r = rx2x3 

AlBl 

+i 

0 

-1 

0 

0 

0 

m 

0 

+1 

4r = rx2x2 

AqBl 

-i 

+2 

n 

0 

0 

0 

+i 

-2 

+1 

12r = rx6x2 

Bq 

+i 

+1 

+i 

-2 

-2 

-2 

+i 

+1 

+1 

18r = rx3x6 

AlBq 

n 

0 

+i 

+2 

0 

-2 

n 

0 

+1 

12r = rx2x6 

AqBq 

+i 

-2 

+i 

-2 

+4 

-2 

+i 

-2 

+1 

36r = rx6x6 


The rule to write down the coefficients of the linear (quadratic) 
main effects is to give a coefficient as +1 (+1) to those treatment 
combinations containing the third level of the corresponding factor, 
coefficient as 0 (-2) to the treatment combinations containing the 
second level of the corresponding factor and coefficient at -1(+1) to 
those treatment combinations containing the first level of the 
corresponding factor. The coefficients of the treatment combinations 
for two factor interactions are obtained by multiplying the 
corresponding coefficients of two main effects. The various factorial 
effect totals are given as 

[A L ] =+1[a 2 b 2 ] +0[ab 2 ] -1 [b 2 ] +1[a 2 b] + 0[ab] -1[b]+1[a 2 ] + 0[a] -1[1] 

[Ad =+1 [a 2 b 2 ] -2[ab 2 ] +1 [b 2 ] +1[a 2 b] -2[ab] +1[b] +1[a 2 ] -2[a] +1[1] 

[BJ =+1[a 2 b 2 ] +1[ab 2 ] +1 [b 2 ] +0[a 2 b] + 0[ab] +0[b] -1 [a 2 ] -1 [a] -1 [1] 
[A l B l ] =+i[a 2 b 2 ] +0[ab 2 ] -1 [b 2 ] +0[a 2 b] + 0[ab] +0[b] -1 [a 2 ] + 0[a] -1[1] 
[AqB l ] =+1 [a 2 b 2 ] -2[ab 2 ] +1 [b 2 ] +0[a 2 b] + 0[ab] +0[b] -1[a 2 ] + 2[a] -1[1] 
[B q ] =+1[a 2 b 2 ] +1 [ab 2 ] +1 [b 2 ] -2[a 2 b] -2[ab] -2[b] -1[a 2 ] -1[a] -1[1] 






























































































Factorial^ Experiments _ 129 

[A l Bq] =+1[a 2 b 2 ] +0[ab 2 ] -1 [b 2 ] -2[a 2 b] + 0[ab] +2[b] +1[a 2 ] + 0[a] -1[1] 
[A q B q ]=+1 [a 2 b 2 ] -2[ab 2 ] +1 [b 2 ] -2[a 2 b] + 4[ab] -2[b] +1[a 2 ] -2[a] +1[1] 
Factorial effects are given by 


A l = [AJ/r. 3 A q = [Ad/r. 3 B L = [BJ/r.3 A L B L = [A L B L ]/r.3 
A Q B L =[A Q B L ]/r.3 B Q = [B Q ]/r.3 A L B Q = [A L B Q ]/r.3 A Q B Q = [A Q B Q ]/r.3 
The sum of squares due to various factorial effects is given by 


ssa l J a l1 2 

r.2.3 

ssa q = [Aq]2 

r.6.3 

ssb l .^ 

L r.3.2 

SSA l B l 

_[a l b l i 2 

r.2.2 

ssa q b l - [A q b lJ 2 

r.6.2 

Ssb - [B o] 2 
B °'" r.3.6 

ssa l b q . [A ^ B ° )2 

L Q r.2.6 

SSAqBq 

_ [AqBqJ 2 
r.6.6 


If a randomized complete block design is used with r-replications 
then the outline of analysis of variance is 


Anova 


Source of 
variation 

degree of 
freedom 

S.S. 

M.S. 

Between 

replications 

r-1 

SSR 

MSR=SSR/(r-1) 

Between 

treatments 

3^—1 =8 

SST 

MST=S ! ST/8 

A 

2 

SSA 

MSA=SSA/2 

a l 

1 

| 


Aq 

1 

E0! 


B 

2 

SSB 

MSB=SSB/2 

b l 

[T 

ssb l 

msb l = ssb, 

Bq 

i 



AB 

4 

SSAB 

MSAB=SSAB/2 

A l B l 

1 


MSA l B, =SSA l B, 

AqB l 

1 

E&2S3M 

MSAqB l =SSAoB, 

AlBq 

1 

SSA l Bq 

MSA L B a =SSA, Bn 

AqBq 

1 

— - - -- - 





SSE 

KSbiSI 

Total 

r.3 -1=9r-1 

TSS 



If the factors are quantitative in nature, retain the SS with single 
d.f.. However, if the factors are qualitative in nature, then this 
partitioning of SS of a particular factorial effect with p d.f. can only be 
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considered &s partition of its SS into single d.f. components by writing 
'p' linearly independent contrasts and these sum of, squares with 
single d.f. can be added to obtain the SS due to corresponding 
factorial effect totals with p d.f. 


In general, for n factors each at 3 levels, the sum of squares due 
to any linear (quadratic) main effect is obtained by dividing the square 
of the linear (quadratic) main effect total by r.2.3 n ‘ 1 (r.6.3 11 ' 1 ). Sum of 
squares due to a ‘p’ factor interaction is given by taking the square of 
the total of the particular interaction component divided by r.^ a 2 ... 

a p ). 3 n ' p , where a 1t a 2 ,., a p are taken as 2 or 6 depending upon 

the linear or quadratic effect of particular factor. 

Exercise 2: A 3 2 experiment was conducted to study the effects 
of the factors Nitrogen (N) and Phosphorus (P) (each at three levels 
0,1,2) on sugar beet. Two replications of nine plots each were used. 
The table below shows the plan and the percentage of sugar 


(approximated to nearest whole number). 


Replication 

T reatment 

N P 

% of sugar 

1 

0 

1 

14 



2 

0 

15 



0 

0 

16 



2 

1 

15 



0 

2 

16 



1 

2 

18 



1 

1 

17 



1 

0 

19 



2 

2 

17 


II 

1 

2 

20 



1 

0 

19 



1 

1 

17 



0 

0 

18 



2 

1 

19 



0 

1 

16 



0 

2 

16 



2 

2 

19 



2 

0 

16 
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Analysis 


Step 1: To find sum of squares between replications, between 
treatments and total sum of squares arrange the data in a Replication 
x Treatment combinations table as follows: 


Repl. 

Ti 

reatment Coml 

binations 

Total 


1 

n 

n 2 


n p 

HUH 

wm 

1 




00 

10 

20 

01 

11 

21 

02 

12 

22 


1 

16 

19 

15 

14 

17 

15 

16 

■ 

17 

151 

2 

18 

19 

16 

16 

17 

19 

16 

20 

19 

1 

Total 

34 

38 

31 

30 

34 

34 

32 

38 

36 

307 


lull 

(T 2 ) 

nzn 

Kil 

Kil 

Id 

Id 

Id 

Id 

wm\ 


Grand Total: 307 

No. of observations (n) = r. 3 2 =18 

Correction Factor (C.F.)= (307) 2 /18 =5236.0556 

Total S.S. (TSS) = Sum (observation ) 2 - C.F. 

= 16 2 + 18 2 +.+17 2 +19 2 -5236.0556 


= 48.9444 




Replication SS (SSR) 


147 2 + 160 2 


-5236.0556 = 9.3888 


__ Sumftreatment totals ) 2 _ 

Treatment SS (SST) =- 1 ---'—C. 


34 2 + 38 2 .+ 38 2 + 36 2 


-5236.0556 =32.4444 


Error SS = Total SS - Replication SS - Treatment SS 
= 7.1112 

Step2: Obtain various factorial effects total 
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[NJ =+1[n 2 p 2 ] +0[np 2 ] -1 [p 2 ] +1[n 2 p] + 0[np] -1[p]+1[n 2 ] + 0[n] -1[1] 


= 5 

[N q ] =+1[n 2 p 2 ] -2[np 2 ] +1 [p 2 ] +1[n 2 p] -2[np] +1[p] +1[n 2 ] -2[n] +1[1] 


= -23 

[PJ =+1[n 2 p 2 ] +1[np 2 ] +1 [p 2 ] +0[n 2 p] + 0[np] +0[p] -1 [n 2 ] -1[n] -1 [1] = 3 
[N l P l ] =+1 [n 2 p 2 ] +0[np 2 ] -1 [p 2 ] +0[n 2 p] + 0[np] +0[p] -1[n 2 ] + 0[n] -1[1] 
= 7 

[N q PJ =+1[n 2 p 2 ] -2[np 2 ] +1 [p 2 ] +0[n 2 p] + 0[np] +0[p] -1[n 2 ] + 2[n] -1[1] 


= 3 

[P Q ] =+1[n 2 p 2 ] +1[np 2 ] +1 [p 2 ] -2[n 2 p] -2[np] -2[p] -1[n 2 ] -1[n] -1[1 ] =13 
[N l P q ] =+1 [n 2 p 2 ] +0[np 2 ] -1 [p 2 ] -2[n 2 p] + 0[np] +2[p] +1[n 2 ] + 0[n] -1[1] 


=-7 


[NqPq] =+1[n 2 p 2 ] -2[np 2 ] +1 [p 2 ] -2[n 2 p] + 4[np] -2[p] +1[n 2 ] -2[n] +1[1] 
=-11 

Step 3: The sum of squares due to various factorial effects is 
given by 

|2 m2 

' L _ r.2.3 ~ 12 

SSNo ,M..rai,,4. 694 4 

° r.6.3 36 

SSP L = = — = 0.7500 

l_ „ o n -4 0 


SSNi =^ = M1 = 2.0833 


r.3.2 12 

ssN L P L =^=mi =6 . 


r.2.2 


8 


1250 


ssn q p l = B£Jl = 1211 

r.6.2 24 

_ Pq] 2 [13] 2 
r.3.6 36 


SSP Q = 


0.375 

4.6944 
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ssn l p q = 


[N l Pq] 2 

r.2.6 


[-7] 2 

LJ - = 2.0417 
24 


ssn q p q = 


Mi]! 

72 


= 1.6806 


Step 4: Construct the ANOVA table as given above and test the 
significance of the various factorial effects. 


Anova 


Source of 

variation 

degree of 
freedom 

S.S. 

M.S. 

■ ■ - -- - - 

F 

tabulated 

Between 

replications 

1 

9.3888 

9.3888 

10.5623* 

Between 

treatments 

8 

32.4444 

4.0555 

4.5624* 

N 

2 

16.7774 

8.3887 

9.4371* 

N l 

1 

2.0833 

2.0833 

2.3437 

Nq 

1 

14.6944 

14.6944 

16.5310* 

P 

2 

5.4444 

2.7222 

3.0624 

Pl 

1 

0.7500 

0.7500 

0.8437 

Pq 

1 

4.6944 

4.6944 

5.2811 

NP 

4 

10.2223 

2.5556 

2.875 

n,p l 

1 

6.1250 

6.1250 

6.8905* 

NqP|_ 

1 

0.3750 

0.3750 

0.4219 

n,Pq 

1 

2.0417 

2.0417 

2.2968 

NoPo 

1 

1.6806 

1.6806 

f 

Error 

8 

7.1112 

0.8889 

| 

Total 

17 

48.9444 




(* indicates the significance at 5% level of significance) 


□ □ □ 































































Chapter - 9 


Confounding 



When in a factorial experiment, the number of treatment or the 
levels of the treatment increases, the size of the block increases, 
resulting in heterogeneity of soil within the block which contributes to 
the increase in experimental error. For example, 2 5 factorial 
experiments would have 32 treatment combinations and blocks of 32 
plots are quite big to ensure homogeneity within them. A new 
technique is therefore necessary for designing experiments with a 
large number of treatments and one such device is to take blocks of 
size less than the number of treatments and have more than one 
block per replication. The division of complete block in to incomplete 
blocks is done in such a way that certain interaction effect (preferably 
higher order interactions) would be made identical to the incomplete 
block comparison. In such cases interaction effect is known to be 
confounded with block effects. This device of reducing the block size 
by taking one or more interaction contrasts identical with block 
contrasts is known as confounding. Preferably, only higher order 
interaction, that is, interactions with three or more factors are to be 
confounded because their loss is immaterial, as an experimenter is 
generally interested in main effects and two factor interactions, these 
should not be confounded as far as possible. 


When there are two or more replications, if the same set of 
interactions are confounded in all the replications, confounding is 
said to be complete and if a different set of interactions are 
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confounded in different replications, confounding is said to be partial. 
In complete confounding all the information on the confounded 
interactions are lost. But in partial confounding information on the 
confounded interactions can be recovered from those replications in 
which they are not confounded. 

For example consider 2 3 factorial experiment let the three factors 
are A, B and C each at two levels. 


Let us divide the 8 treatments combinations in two blocks of size 4. 



Here the Block effect (B 2 - BJ= ( a bc)+(a)+(b)+(cH1)-(ab)-(ac)- 
(bc) is identical with the interaction ABC = (a-1) (b-1) (c-1). 


Here we say ABC effect is confounded or mixed up with the 
block effect and can not be isolated from the block effect. 

Advantages of confounding 

If reduces the experimental error considerably by stratifying the 
experimental material in to homogeneous groups. 

Disadvantage of confounding 

(1) Here the increased precision is obtained at the cost on certain 
relativity unimportant interactions. 

(2) Confounded contrasts are replicated fewer times than are the 
other contrasts and as such there is loss of information on them 
and they can be estimated with a lower degree of precision as 
the number of replication for them is reduced. 

Complete Confounding: 

Let us have three factors A, B, C each at two levels. ABC 
interaction is confounded in all the replications. Layout of 2 3 







experiments in blocks of 4 units where ABC is completely 
confounded in all the three replications. 


Rep 

Blocks 


R-l 

Bi_B2 


ab 

b 

ac 

c 

be 

abc 

(i) 

a 


R-ll 

B 3 _B 4 


ac 

c 

ab 

b 

(i) 

a 

be 

abc 


R-lll 

B 5 B fi 


(i) 

abc 

ab 

a 

be 

c 

ac 

b 


For all the three replicates B 2 - = B 4 - B 3 = B 6 - B 5 = ABC 

interaction. Hence ABC is confounded in all the replicates as ABC 
effect is equal to Block effect in each replication. 


Analysis 

For carrying out the statistical analysis, the various factorial 
effects and their S.S. are estimated in the usual manner using Yates 
technique of the factorial experiments with the modification that for 
completely confounded interactions neither the S.S. due to 
confounded interaction is computed nor it is included in the ANOVA 
table. The confounded component is contained in the (2p-1) d.f. (in 
case of p replicates) due to blocks. The partitioning of the d.f. for 2 3 

completely c onfounded factorial is as follows: _ 

Source of Variation_Degrees of Freedom 


Blocks 

2 p-1 

A 

1 

B 

1 

C 

1 

AB 

1 

' AC 

1 

BC 

1 

Error 

_6(p-1)__ 

Total 

8 p-1 

In general for a (2 n , 2 k ) completely confounded factorial in p 
replications, the different d.f.’s are given as follows: 
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Source of Variation 

Degrees of Freedom 

Replication 

P-1 

Blocks within Replication 

i 

* 

c 

w 

Q. 

T reatment 

1 

« 

c 

i 

i 

c 

C\J 

Error 

By subtraction 

Total 

p2 n -1 


The treatment d.f. has been reduced by 2 nk -1 as this is the total 
d.f. confounded per replication. 


Partial Confounding 

Here we confound different effects in different replications, so 
that we can get information on each effect from the replications in 
which the effect is not confounded. Here the effects are said to be 
partially confounded. 

Example 

R -i R-ll R-lll 



ABC is confounded AC is confounded AB is confounded 


General method of construction of confounded Designs in 2 n 
factorial experiments: 

Let the design is (2 n , 2 k ) i.e. 2 n treatment combinations arranged 
in 2 k plots per block. 

Treatment combinations = 2 n . 

Block size = 2 k 

Number of blocks per replication = 2 n k 
Total number of interaction confounded = 2 n ' k -1 
Number of independent interaction confounded = (n-k) 
Generalized interaction confounded = (2 n k -1) - (n-k) 
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(i) Choose (n-k) independent effect to be confounded. In general 
these should be of higher interaction of lesser importance. Their 
generalized interaction should not contain main effects or two 
factor interactions. 

(ii) Choose k independent treatment combinations having even 
number of letters common with the (n-k) independent 
confounded effects. (0 is also taken as even number). Denote 
them by 1 , 2 , ..... k. 

(iii) To find out the other contents of the blocks take 1 x 2, 1 x 3 , 2 x 
3, 1 x 2 x 3 and delete a 2 b 2 etc. when so ever appears. The 
process will provide 2 k -1 treatment combination of the block. 

(iv) Add control ( 1 ) to complete the 2 k treatment combination of key 
block. This block will be known as key block. 

(v) Multiply the contents of the key block by a treatment combination 
which has not appeared in the key block. As done earlier delete 
a 2 b 2 etc. when so ever appear. This process will provide 2 k 
treatment combinations of the llnd block. 

(vi) Like wise multiply the contents of the first block by a treatment 
combination which has not appeared in 1st or llnd block earlier to 
obtain the contents of the third block. 

(vii) The process is repeated till all the desired blocks are obtained. 

Example 

Consider (2 4 , 2 2 ) factorial experiment in two replications. Let us 
confound (n-k) independent interaction in each replicate So 2 
independent interactions are confounded. 

Let in rep I ABC and AD are confounded. So the generalized 
ABC x AD= A BCD = BCD is also confounded. 

Let in rep II BC and ABD are confounded. The Generalized 
interaction BC x ABD = AB 2 CD = ACD is also confounded. 

No. of plots per replicate in = 2 4 ' 2 = 2 2 = 4 

Find k i.e. 2 independent treat combination which have even no. 
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of letter common with the confounded interaction ABC and AD, they 

c and abd and obtained the key block as given in the 
general method of construction of design. 


Rep-I 


(1) 

(2) 

(3) = (1x2) 

(4) 

be 


bed 


(1) 


abc 

abd 


ab 


abed 


bd 

acd 


ac 


ad 


cd 

(1) 


d 


c 


a 


Bi 


\b~ 2 


b 3 


b 4 


The other block can be obtained from the key block by 
multiplying the content of the key block by d, c and a respectively. 


Similarly the Block content of the Rep-ll can be obtained. Find 
out two independent treatment combination let ad and abc which 
have even number of letters common with the confound interactions 
BC and ABD. The key block and rest of the block can be formed in 
the same way as given in the general method of construction. 


(i) 

ad 


abd 


acd 


a 

(2) 

abc 


ac 


ab 


abed 

(3) = (1x2) 

bed 


cd 


bd 


be 

(4) 

(i) 


(b) 


c 


d 

L 

Bi 


b 2 


b 3 


b 4 


Partial confounding 

In case of partial confounding, the S.S. for all the effects, other 
than the confound one, are obtained as usual using Yates technique 
given in the factorial experiments. For confounded effects, since the 
effect is not confounded in all the replications, therefore we can 
estimate the effects confounded in one replication from the other 
replication in which it is not confounded. In (2 n ,2 k ) factorial 
experiment with p replications, following is the splitting of d.f.’s. 













The S.S. for confounded effects are to be obtained from those 
replications only in which the given effect is not confounded i.e. in 
calculating the S.S., the observation pertaining to the replication in 
which it is confounded is not considered. 

The S.S. due to confounded interaction is given by 

S.S. = [Z] 2 /(r-k*)2 n 

where Z is the interaction confounded and k* is the number of 
replication in which the given interaction is confounded. 


Example 

Analyse the following 2 3 Factorial-experiment in blocks of 4 plots, 
involving three fertilizers N, P, K each at two levels. 



Step 1. Identify the interactions confounded in each replicate. 
Here, each replicate has been divided into two blocks, one effect has 
been confounded in each replicate. The effects confounded are 
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Replicate I : NP Replication II : NK Replicate III : NPK 
Step 2: Obtain the C.F., blocks S.S. and Total S.S. 

CF. = (2088) 2 /24 = 181656 


S.S. due to Blocks = 
Total S.S. = I(Obs.) 2 


I -L-C.F. = 2506 

i = 1 4 

-C.F. = 8658 


Step 3: Obtain the sum of squares due to all the factorial effects 
other than the confounded effects using Yates technique. 


T reatment 

Combinations 

Total 

Yield 

Factorial 

Effects 

Sum of Squares 
(S.S.) = [ ] 2 /2 3 .r 

(1) 

255 

G=2088 


n 

223 

[N]=48 

96= Sf; 

P 

253 

[P]=158 

1040.17 =Sp 

np 

308 

[NP]=66 

- 

k 

232 

[K]=10 

4.17= S k 2 

nk 

255 

[NK]=2 

- 

pk 

280 

00 

1 

II 

* 

CL 

2.67= Sp k 

npk 

282 

Z 

“0 

II 

1 

o 

00 

- 


Step 3: Obtain the S.S. of the confounded interactions. The 
interaction NP is given by 


[NP] = [npk] + [np] + [k] + [1] - [pk] - [nk] - [p] -[n] = 92 

This total is obtained by excluding the observations in the first 
replicate as NP is confounded in this replicate. Similarly [NK] is 
obtained from replications I and III and [NPK] is obtained from 
replications I and II. 


[NK] = [npk] + [p] + [nk] + [1] - [pk] - [np] - [k] - [n] = -18 
[NPK]= [npk] + [p] +[k] + [n] - [np] - [nk] - [pk] -[1] = -62 
The S.S. are obtained as follows. 


S^ p = S.S. due to NP = 


r~r(NP ] 2 = 529; 
16 
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Sl K = S.S. due to NK = -^[NK] 2 =20.25; 
S 2 pk = S.S. due to NPK = T [N PK] 2 =240.25 


Treatment S.S. = + Sp + + s np + s 


NK 


+ Spf< + S 


NPK 


= 1932.7501 


ANOVA 


w m « 

Source of 

Variation 

d.f. 

Sum of 

Squares 

M.S. 

Variance 

Ratio ‘F’ 

Block 

5 

2506 

501 

1.31 

T reatments 

7 

1932.75 

276.107 

- 

N 

1 

96.00 

96.00 

- 

P 

1 

1040.16 

1040.16 

2.71 

NP 

1 

529.00 

529.00 

1.3 

K 

1 

4.41 

4.41 

- 

NK 

1 

20.25 

20.25 

- 

PK 

1 

2.66 

2.66 

- 

NPK 

1 

240.25 

240.25 

- 

Error 

11 

4219.24 

383.57 

- 

Total 

23 

8658 




indicates that these ratios are less than one and hence these 
effects are non-significant. 


From the above table it is seen that effects due to blocks, main 
effects due to factor N, P, and K and interactions are not significant. 

Given a set of interactions confounded, how to obtain the blocks? 

The blocks of the design pertaining to the confounded interaction 
can be obtained by solving the equations obtained from confounded 
interaction. 

Example: 

Construct a 2 factorial in 2 3 plots per block confounding 
interactions ABD, ACE and BCDE. 

Let x 1( x 2 , x 3 , x 4 and x 5 denote the levels (0 or 1) of each of the 5 
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factors A,B,C,D and E. Solving the following equations (mod 2) would 
result in different blocks of the design. 


For interaction ABD: x } + x 2 + x 4 = 0,1 

For interaction ACE: x^ + x 3 + x 5 = 0, 1 

BCDE is the interaction that gets confounded automatically and it 
is called as the generalized interaction. Treatment combinations 
satisfying the following solutions of above equations will generate the 
required four blocks 


(0,0) (0,1) (1,0) (1,1) 

The solution (0, 0) will give the key block (A key block is one that 
contains one of the treatment combination of factors, each at lower 
level). 


There will be 



blocks per replicate. The key 


block is as 


obtained below: 
ABC 
1 1 1 

1 1 0 

1 0 1 

1 0 0 

0 1 1 

-0 1 0 

0 0 1 

0 0 0 


D 

0 

0 

1 

1 

1 

1 

0 

0 


E 

0 

1 

0 

1 

1 

0 

1 

0 


abc 

abe 

acd 

ade 

bcde 

bd 

ce 

(1) 


Similarly we can write the other blocks by taking the solutions of 
above equations as (0,1 ) (1,0) and (1,1). 


Given a block, how to find the interactions confounded? 


The first step in detecting the interactions confounded in a 
replicate is to select the key block. If key block is not given, it is not 
difficult to obtain it. Select any treatment combination given in the 
given block and multiply all the treatment combinations in the given 
block by that treatment combination and we get the key block. From 
the key block we know the number of factors as well as the block 
size. Let it be n and k. We know then that the given design belongs to 
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the 2 n factorial in 2 r plots per block. The next step is to search out a 
unit matrix of order r. From these we can find the interaction 
confounded. 


Example 

Given the following block, find out the interactions confounded. 

acde ad 

bed bde 

e ab 

abec c 

Since the given block is not the key block. Let us multiply it 
by e. We get the following block, 
acd ade 

bede bd 

(1) abe 

abc ce 


This is the key block as it includes (1). It is obvious that the 

3 

factorial involves five factors and has been confounded in 2 (=8) 
plots per block, Hence, the given design is (2 5 , 2 3 ). 


* 

★ 


★ 


A B • C 
1 0 1 
0 1 1 
0 0 0 
1 1 1 
1 0 0 
0 1 o 
1 1 0 
0 0 1 


D 

1 

1 

0 

0 

1 

1 

0 

0 


E 

0 

1 

0 

0 

1 

0 

1 

1 


* indicates the rows of a unit matrix of order 3. 


ABC 
1 0 0 
0 1 0 
0 0 1 


D E 

IHaO 1(=pi) 

1 (=(a 2 ) 0(=(3 2 ) 

0(=(a 3 ) 1 (=p 3 ) 


The interaction confounded are A^B^C^D, A^ l B^ 2 C^ 3 E. 
Here ABD and ACE are independent interactions confounded and 
BCDE is obtained as the product, of these two and is known as 
generalized interaction. 


Confounding _ _ 

Confounding in 3 n Series 

The concept of confounding here also is the same as in 2 
series. We shall illustrate the principles of confounding in 3 n in 3 
plots per block with the help of 3 3 experiments laid out in blocks of 
size 3 (=9). Let the three factors be A, B and C and the confounded 
interaction be ABC 2 . The three levels of each of the factor be 
denoted by 0, 1 and 2 and a particular treatment combination be 

XjXjXk, i, j, k = 0, 1,2. 

Number of blocks per replication =3 n r = 3 

Block size = 3 r = 9 

Degrees of freedom confounded =2 


Number of interactions confounded per replicate 


3 n ~ r -1 
3-1 


= 1 


The number of treatments in 3 blocks are determined by solving 
the following equations mod (3) 

x 1 +x 2 +2x 3 =0 

x 1 +x 2 +2x 3 =1 

Xt+x 2 +2x 3 =2 
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Balanced Design; 

A partially confounded design is said to be balanced if all the 
interaction of particular order are confounded in equal number of 
replication. 

How to construct a Balanced Factorial Design 

Example: Construct a (2 5 , 2 3 ) balanced design achieving 
balance over three and four factor interactions. 

Solution: Total no. of treatment combination = 2 5 = 32 

Number of blocks per replicate= 2 5 ' 3 =4 

The number of 3 factor interactions = ( 5 C 3 ) = 10 

The number of four factor interactions = ( 5 C 4 ) = 5 

So the total degrees of freedom to be confounded = 10+5 =15 

Since the design is (2 5 , 2 3 ), the degrees of freedom that can be 
confounded per replicate = 2 5 ' 3 -1=3. 

So the number of replicates required = 15/3 = 5 

Since the total number of three factor interactions is 10 and we 
have five replicates so each block confounds two, three factor 
interactions such that generalized interaction is of four factor. 

Three factor interactions are ABC, ABD, ABE, ACD, ACE, ADE, 
BCD, BCE, BDE, CDE. Four factor interactions are ABCD, ABCE, 
ABDE, ACDE, BCDE. 

Confounding ABD and ACE using equations 
Xi + X 2 + X 4 = 0, 1 
Xi + X 3 + X5 = 0, 1 

would result in replication I consisting of four blocks. Here only 
the key block each of the replication is given. Similarly Replication II 
confounds ACD, BCE a ABDE, Replication III confounds BCD, ABCE 
and ADE, Replication IV confounds ABCD, ABE and CDE, 
Replication V confounds ABC, BDE and ACDE. 
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Replication 1 




Replication II 



A 

B 

C 

D 

E 


A 

B 

c 

D 

E 


1 

0 

0 

1 

1 

ade 

1 

0 

0 

1 

0 

ad 

0 

1 

0 

1 

0 

bd 

0 

1 

0 

0 

1 

be 

0 

0 

1 

0 

1 

ce 

0 

0 

1 

1 

1 

cde 

1 

1 

0 

0 

1 

abe 

1 

1 

0 

1 

0 

abd 

1 

0 

1 

1 

0 

acd 

1 

0 

t 

0 

1 

ace 

0 

1 

1 

1 

1 

bcde 

0 

1 

1 

1 

0 

bed 

1 

1 

1 

0 

0 

abc 

1 

1 

1 

0 

0 

abc 

0 

0 

0 

0 

0 

(1) 

0 

0 

0 

0 

0 

(1) 


Replication III 




Replication IV 



A 

B 

c 

D 

E 


A 

B 

c 

D 

E 


1 

0 

0 

0 

1 

ae 

1 

0 

0 

1 

1 

ade 

0 

1 

0 

1 

1 

bde 

0 

1 

0 

1 

1 

bde 

0 

0 

1 

1 

1 

cde 

0 

0 

1 

1 

0 

cd 

1 

1 

0 

1 

0 

abd 

1 

1 

0 

0 

0 

ab 

1 

0 

1 

1 

0 

acd 

1 

0 

1 

0 

1 

ace 

0 

1 

1 

0 

0 

be 

0 

1 

1 

0 

1 

bee 

1 

1 

1 

0 

1 

abce 

1 

1 

1 

1 

0 

abed 

0 

0 

0 

0 

0 

(1) 

0 

0 

0 

0 

0 

(1) 

Replication V 










A 

B 

c 

D 

E 








1 

0 

0 

1 

0 

ac 







0 

1 

0 

1 

1 

bee 







0 

0 

1 

0 

1 

de 







1 

1 

0 

0 

1 

abe 







1 

0 

1 

1 

1 

abce 







0 

1 

1 

1 

0 

bde 







1 

1 

1 

0 

0 

abd 







0 

0 

0 

0 

0 

(1) 
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Chapter -10 


Split and Strip Plot Design 



The most common procedure for experiments with factorial sets 
of treatments is to use a randomized block design and assign the 
treatment combinations at random to plots within the blocks. 
However the requirement that all combinations be assigned at 
random can lead to another difficulty for conducting the experiment. 
For example 


1. An agronomist might be interested in tillage requirements as one 
factor and nitrogen level as another. Because tillage treatments 
require machinery, they require larger plots, while nitrogen can 
be applied by hand and can be applied to smaller plots. 

2. Experiments involving planting date as a factor would be easier 
to conduct if all the plots to be planted at one time could be 
grouped together. 

These examples illustrate the type of treatments that require 
different sizes of plots for their efficient application. So it would be 
convenient to have an experimental design in which level of one 
factor could be applied to larger plots while the levels of another 

factor are applied to smaller plots. This is possible with split plot 
design. 

In the split plot design the level of one factor are assigned at 
random to larger plots and the larger plots are then divided into small 
plots within the large plots. The larger units are called whole plots or 
main plots, while the smaller units are called split plots or subplots. 

For example the main plot factor are three methods of tillage: Ti, 
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T 2 , and T 3 and subplots factor might be four barley varieties: V 1t V 2 , 


V 3 and V 4 - The field plan for two replications will be 


Rep-I 



Rep - II 

| 

t 2 

mm 



Ti 

t 3 

t 2 

HHI 

V 4 

mm 



Vi 

mm 

■EH 

Vi 

V, 

mm 



WBm 

Vi 

■■ 

Ki 

KI 

mm 


■Hi 

— 

mm 

Vi 



mm 


■II 

mm 

■HI 

v 2 


Here the precision for the measurement of the effects of the 

main plot factor is sacrificed to improve that of subplot factor. Main 

plot treatment is estimated with low precision while subplot treatment 

and the interaction are estimated with higher precision. 

Advantages 

1. It permits the efficient use of some factors that require different 
sizes of plots for their applications. 

2. It permits the introduction of new treatments into an experiment 
that is already in progress. 

3. It provides increased precision in the estimation of some of the 
factorial effects. 

Disadvantages 

1. The whole plot factor is estimated with less precision than the 
sub plot. Hence large differences are often required for 
significance with the whole plot factor, while the subplot factor 
might have significant effects that are too small to be of practical 
significance. 

2. Statistical analysis is more complex than that required for the 
RBD because different standard errors are required for different 
comparisons. 

Model 

Let A be the main treatment with p levels, a 1f a 2 .a p and B be > 
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sub treatment with q level, bi, b 2 .b (| . Let r be the number of 

replications. 

Further, let Y ijk denote the observation of ajb k treatment 
combination in i th replicate. Then the model is 

Y ijk = p + n + a, + ejj + b k + (ab) jk + e ijk 

i = 1,2,.r j = 1,2,.... p, k = 1, 2,.... q 

where p is the general mean, r„ a v and b k are the effects of i 
replication, j th level of A, k th level of B and (ab)j k is the interaction 
effect, e^ and ejj k are the random errors distributed independently and 
normally with mean o and variances o\ and respectively i.e. e,j ~ 

N(0, ol) and e ijk ~ N(0, ol) 

Method of Analysis 

Let A be the main treatment with a levels as a^ a 2 .. a p and 

B be sub treatment with q levels as bi, b 2 .b q . Let r be the 

number of replications. The data are entered in the following date 
sheet. From the marginal totals and sub totals prepare R x A and A x 
B tables. 


Replicate 

T reatment 

1 

2 

i . 

.r 

Totals 

combination 





aib! 





aibi 

aib 2 





aib 2 

• • 

aib q 





a f b q 

Sub total 

na! 

*2 ^ n aj 

r r a. 


a 2 bi 





a 2 bi 

a 2 b 2 





a 2 b 2 

a 2 b q 





a 2 b g 
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Sub total 


R x A table 


a 


Totals 


na 2 r 2 a 2 




r ' a R r 2 a p ^a. 




A x B Table 



Totals 


a 2 

a p 


a P bi 



Let R 1? R 2 .R r be totals of r replicates 

Ai, A 2 .A p be totals of a levels of factor A 

Bi, B 2 .B q be totals of b levels of factor B 

Then N = rpq, G = IIIY ijk = XRi = IAj = IB k = Grand Total 
C.F. = G 2 /N 
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ANOVA Table 


Source 



Error(a) 



d.f. 

S.S. 

M.S. 

r-1 

IR, 2 /pq-C.F. = SSR 

MSR 

p-i 

IAi 2 /rq-C.F.=SSA 

MSA 


Ea 

q-1 

IB k */rp-C.F. =SSB 

MSB 

ESDI 

BsSni 

II(a i b k ) 1 7r-C.F.-SSA- 

SSB=SSAB 

MSAB 

tsssm 

IgSh 

By subtraction = SSE b 

MSE b 

bsbh 

zy 2 -c.f. 




MSA/E, 



: b 


'E b 


Main 

Sub 


Error (b) 


Total 


The main treatment A is tested against error (a) while sub 
treatment and the interaction A x B are tested against Error (b). 



Standard Errors and Critical Differences 


The standard errors for the comparison of the different treatment 
means are presented in the form of the table. 


Difference 

between 

Notation 

<0 

m 

Q. 

C.D. at a 
level of 

significance 

Two A 

mean 

a,-a. 

V 2E a /rq 

SE( d) X t[ a , ( r .-|) 

(P-1)] 

Two B 

mean 

bi-bi 

V 2E b /r q 

SE( d ) X t[ot, p(r-l) 

(r-1)] 

Two B 

means at 
the same 
level A 


V2E b /i- 

SE (d ) X t[ a , p(r-l) 
(r-1)] 

Two B 

means at 
the same or 
different 
level of A 


V2[(q-1)E b +E a ]/rq 

SE( d ) X t[ u , p(r-l) 

(r-1)] 
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In the case of treatment means of two main treatments at the 
same level of sub-treatments or different levels of sub-treatment, the 
ratio of difference between two treatment means and its SE does not 
follow students t-distribution. Let L and t 2 be tabulated value of t 
corresponding to the degree of freedom for main plot error and sub 
plot error, then the tabulated value of t for the above said comparison 
becomes. 

t _ (q-1)E b t 2 +E fl t 1 
(q-1)E b +E a 

Presentation of results: 


A x B mean table (which is obtained by dividing A x B total table 
contents by r and taking means on both sides) along with various 
standard errors and C.D. values can be presented as follows: 


A x B Mean Table 


Si 

a 2 

a P 

Means 

bi 

a-ib^r 

a 2 bi/ r 

apbi/r 

bi 

b 2 

a-)b 2 /r 

cb^/r 

a p b 2 /r 


- 

• 


bq 

a^q/r 

a 2 b q /r . 

a p b q /r 

bq 

Means 

ai 

a 2 

a p 

G 


SE d for A = B = A x B = 

C.D. for A =; B= ; Ax B = 


Spilt-split plot design 

The sub plots in spilt plot design can further be divided into sub 
plots to accommodate one more factor. For example, irrigation levels 
will be taken as main treatments, levels of nitrogen as sub-treatment 
and methods of application of nitrogen as sub sub-treatment. The 
method of analysis of this design will have three error components 
Instead of two. Let the number of replications is ‘r’ main plot 
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treatments ‘p’ sub plot treatment ‘q’ and sub sub-treatment ‘s’. It is a 
simple extension of split plot design and hence sub-sub treatments 
are more precisely compared than sub treatments which are in turn 
more precisely compared than main treatments. The different sum of 
squares is computed on the basis of sub-sub unit. The divisor at 
each stage will be the number of sub-sub units involved in the 
numerator. 


ANOVA Table 


Source 

d.f. 

SS 

M.S. 

Fcal 

Replication 

(r-1) 

SSR 

MSR 

MSR/MSEi~F(r-1). (r-l)(p-1) 

A 

(P-1) 

SSA 

MSA 

MSA/MSEi~F(p-i), (r-i) (p-t) 

Main plot error 

(Ei) 

(r-D(p-l) 

SSEi 

MSEi 


Sub plot 

treatment B 

(q-i) 

SSB 

MSB 

MSB/MSE2~F(q-l). p(q-1) (r-1) 

Interaction AB 

(p-i) (q-i) 

SSAB 

MSAB 

MSAB/MSE 2 -F(p-1) (q-i). p(q-l) (r-1) 

Sub plot error 

(E 2 ) 

p(q-1)(r-1) 

SSE 2 

MSE 2 


Sub Sub 

treatment C 

(s-1) 

SSC 

MSC ! 

MSC/MSE 3 ~F(S-1), pq(r-l) (s-1) 

Interaction AC 

| 

SSAC 

MS/AC 

MSAC/MSE 3 ~F(P- 1 ) (S-1), pq(r-D (s-1) 

AB 


SSAB 


MSAB/MSE 3 -F(q-l) (s-1), pq(r-l) (s-1) 

ABC 

(p-1)(q-1)(s-1) 

SSABC 

MSABC 

MSABC/MSE 3 ~F(p- 1) (q-1) (s-1),pq(r-1) (s-1) 

Sub-sub plot 

error (E 3 ) 

pq(r-l) (s-1) 

sse 3 

MSE 3 



Total 

rpqs-1 






The standard error for the difference of two treatments means 
are same for the main unit and sub plot treatments as the case of 
split plot design except the extra divisor y[s~. The standard errors for 
remaining comparison are given below. 
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Standard Errors 


Difference 

between 


Two C 
Means 


Two C 
means at 
the same 
level of A 


Notation 


Ci-Cj 


SEc 


a i C j ' a i c k 


^MSEj/rpq 


^2MSE 3 /rq 


Two C 
means at 
the same 
level of B 


b i c i -bjC k 


^MSEj/rp 


Two C 
means at 
the same 
evel of AB 


a ibj c k -a i b|C l 


3 /2MSE 3 /r 


Two B 
means at 
he same 
or different 
level of C 


b|C k -b,c k or 


3 /2[(s-1)MSE 3 +MSE 2 ]/rps 


b iC k bjC e 


Two BC 
means at 
the same 
level of A 


a jbjC j — aibjC k 


^2[(s -1) MSE 3 + MSE 1 ]/rs 


Two A 

means at 
same or 

different 
level of C 


ajCj -ajCj or 


> /2[(s-1)MSE 3 n-MSE^/rsq 


3jCj ajC k 


Two A 
means at 
the same 
level of B 
and C 


a ib k C| -a|b k C| 


,/ 2 [(s -1)MSE 3 +(q-1)MSE 2 +MSE,]/rsq 


Missing Observation in Split Plot Design 

Let a single sub unit in which the treatments ‘afo’ occurs is 
missing in split plot layout, then the estimate of the missing 

observation is 
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_ rA i +q(a i b j )-(a i ) 
j (r -1 )(q -1) 

where Aj is the total of the whole or main unit in which missing 
sub unit occurs, (ajbj) be the total of all the remaining sub units which 
receive the sub treatment ajbj and fo) be the total of all the sub units 
which receive the whole unit, a t over all replications. If one sub unit is 
missing one d.f. is subtracted from errors of sub plot and also from 
the total. The treatment sum of squares and error for main plots are 
slightly inflated due to substitution of missing value. 

Experiments involving more than two factors 

When number of factors are to be tested is more than two then 
either split split plot design may be used or two or more factors can 
be combined in main treatments or in sub treatments or both in main 
x treatments. 


Illustration 


Give layout, analyse and precision of various factors and their 
interactions of a split plot design with 3 replication and having 
varieties = 2 (Vi, V 2 ), fertilizers = 3 (F 1? F 2 , F 3 ); Pesticides = 2 (P^ P 2 ) 
when (i) fertilizers are taken as main plot and varieties and pesticides 
both are taken as subplot treatment (ii) fertilizers and pesticides are 
both taken as main treatments and varieties are taken as sub 
treatment. 


Solution 


Case I: Fertilizer are taken as main treatment while varieties and 
pesticides both are taken as sub treatments. The layout is 


Rl 


Rll 


Rill 

wm 

mm 

F 3 


mm 

f 3 

f 2 


f 3 

mm 

mm 

EK3I 

V,p, 

ESS 


ViP, 

V 2 Pi 

DO 


ESI 

ViPi 

V 2 Pi 

v,p 2 

BIS 

V,p 2 


BIS 

Vi Pi 

DEI 


Vi Pi 

< 

T> 

N> 

B83 

E2QI 

ViP 2 

Vi Pi 


CM 

CL 

> 

TEM 

ESI 


ViP 2 

r- 

CL 

> 

ESI 


V 2 Pi 

CL 

CM 

> 


CL 

CM 

> 

ViP 2 

Vi Pi 


V 2 Pr 

HE 

DO 
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From the treatments totals prepare the various interaction tables 
like P x F, F x V and V x P as explained in the procedure earlier 
and find totals for replicates, fertilizers, varieties and pesticides. Let 
Ri. R 2 , R 3 ; Fi, F 2) F 3 ; V-,, V 2 , V 3 and P 1t P 2 be replicate totals, fertilizer 
totals, varieties totals and pesticide totals respectively. 


ANOVA Table 


Source 

d.f. 

ss 

M.S. 

Fcal 

Replicates 

r-1 =2 

^Rf/ 12 -c.F. 

MSR 

MSR/ MSEa 

Main 

treatments 





(Fertilizers) 

IBB 

^Ff/12-C.F. 

MSF 

MSF/MSEa 

Error (a) 

(r-1) (f-1 )=4 


MSEa 


. O.r. - OOI\ — oOl 

4 

1 Sub 

treatments 





(varieties) 

(v-1) = 1 

2X /i8 - c - F - 

MSV 

MSV/MSEb 

Pesticides 

P -1 = 1 

^Pi 2 /18-C.F. 

MSP 

MSP/MSEb 

Var. x 

pesticides 

(v-1)(p-1)=1 

7,») 2 

MSVP 

MSVp/MSEb 

o.r. - OOV — oor 

9 

Main x sub: 





Fed. x 

varieties 

(f-1) (v-1 )=2 

yy (W ! 

^4-f j p p cop pew 

MSFV 

MSFV/MSEb 

o.r, • oor — oov 

9 

Fert. x 

pesticides 

(M) (P-1) 

=2 

yj/fjPi) 2 

MSFP 

MSFP/MSEb 

v^.r. oor oor 

6 

Fert. x 

varieties x 

pesticides 

(f-1)(v-1) 

(P-1)=2 

El , CF ssv 

MSFVP 

MSFVP/MSEb 

v/.r. OOV — 

3 

SSF - SSP - SSFP - SSVF - SSVP 

■■I 

t)_ 

By subtraction 

MSEb 


Total 


IY 2 - C.F. 




Here the main treatment (fertilizer) is estimated with low 


precision while both the sub treatments (varieties and pesticides) and 
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all the interaction are estimated with higher precision. 


Presentation of results: Prepare the following mean tables in 
cyclic order from tables of totals and take means, give the standard 
error of mean or difference and CD values._ 


PxF 


F x V 


V 

--—— 


■ 


m 

f 2 

f 3 








El 

Pi 





F 2 












f 2 







p 2 ^ 





f 3 h 





■ 

■ 


n 



■ 









Case-11 : When combinations of fertilizer and pesticides are taken 
as main plot treatments and varieties as sub-treatments. 


Layout 


1 

1 

J3 

-*• 1 

1 


F 2 Pi 

f 3 p, 

f 2 p 2 

F 3 Pi 

□T 

LL 

CM 

> 

Vi 

CM 

> 

v 1 

v 2 

Vi 

Vi 

CM 

> 


CM 

> 

Vi 

V 2 


r 2 

F1P1 

CM 

0 L 

CM 

Li_ 

F 3 Pi 

FiP 2 

f 3 p 2 

F 2 Pi 

v 2 

Vi 

V 2 

Vi 

V 2 

Vi 

Vi 

V 2 

Vi 

V 2 ^ 

Vi 

V 2 


r 3 

f 3 p 2 

f 2 p 2 

f 2 p 4 

F1P1 

FiP 2 

F 3 Pi 

Vi 

V 2 

Vi 

v 2 

V 2 

V 2 

V 2 

Vi 

V 2 

Vi 

Vi n 

Vi 


Prepare P x F, F x V and V x P tables of totals as explained in 
the previous case and find totals for various levels of fertilizer, 
pesticide and varieties. Let R 1t R 2 , R 3 ; F 2 , F 3 ; Vi, V 2 , V 3 and Pi, P2 b e 
replicate totals, fertilizer totals, varieties totals and pesticides totals 
respectively. 
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Split and Strip Plot Design 
ANOVA Table 


S 


Main treatment 


d.f. 

S.S. 

r-1=2 

LRi 2 /12 -C.F. 


ze 


Pesticides 


Fell. 

pesticides 


Error(a) 


f -1 =2 


ZFi 2 /12-C.F. 


= 2 ZPj 2 /18-C.F. 


X (f-1) (p- life) 2 _ 


1)=2 


(M) 

(fP'1)=1 


-C.F.-SSF-SSP 




-C.F.-SSR - SSF 


- SSP-SSFP 



Sub-treatment 


Varieties 


Main x sub 


Fert. x 
varieties 


Fert. x 
pesticides 




(f-1) (v-1) =2 

life 2 

1 C.F. SSF SSV 

6 

(f-1) (P-1) 

= 2 

IZ(fiPi) 2 

11 C.F. SSF SSP 

6 

-SSFP 

(f-i) (p-i) 

(v-1) =2 

II(fjPjV k ) 2 

|lk C.F. SSF SSP 

6 

- SS FP - SSV - SSFV - SSV P 


V/Eb 


FV/Eb 



E 


Total 


LY 2 -C.F. 


Here both the main treatments fertilizers and pesticides and their 
interaction FxP are estimated with low precision, while the sub 
treatment varieties and all other interaction of various factors with 
varieties are estimated with higher precision. The results are 
presented in the same way as in the previous case. 


Example: 

A wheat breeder wanted to determine the effect of planting date 
on the yield of four varieties of winter wheat. 
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Experimental Designing^ and Data Analys is 

Planting date: Di = Oct. 10; D 2 = Oct. 20; D 3 = Oct. 30 

Varieties are: V 2 ; V 3 ; V 4 

To facilitate the operation of planting and harvesting a split-plot 
design with planting dates assigned at random to main plots within 
each of three blocks. Varieties were then assigned at random to plots 
with planting dates. The field plan and yields (kg/plot) are given 
below. Analyse the data and describe results. 


Blocks I II III 



Data Analysis 

The preliminary steps in the data analysis involve the 
construction of two tables of totals. These are shown as table 1 and 
2. The analysis of variance of the yield data is presented in table 3. 

c F = — = ( 851 ) j - 20,116.69 
rpq 36 

Total SS (SSTot) = ZISyf k -CF = 26 2 +.+ 28 2 -20,116.69 

i j k 

= 1596.31 

SS due to replication/ blocks (SSR) 

= J-£R 2 -CF = — (273 2 + 299 2 +279 2 )-C.F. =30.89 
pq , ' 12 
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Table-1: Two-way table of Block x Date totals 


Block 

Date 

i 

II 

II 

Sum 

Di 

97 

97 

87 

281 

d 2 

104 

106 

115 

32S 

mi 

72 

96 1 

77 

245 


273 

299 

279 

851 


Table-2: Two-way table of Variety x Date (totals) 


Variety 

Date 


v 2 

v 3 

v 4 

Sum 

Di 

92 

65 

76 

48 

281 

d 2 

112 

81 

66 

66 

325 

d 3 

69 

62 

62 

52 

245 

Sum 

273 

208 

204 

166 

851 


SS due to main plots (SSA) 

=—I A? -CF = — (281 2 +325 2 + 245 2 )-20116.69 =267.55 

rq Y 12 


SS due to error A (SSEA) = — ££R 2 -CF- SSR - SSA 

^ i j 

= — (97 2 + ... + 77 2 )- 20116.69 -30.89 - 267.55 =83.11 
4 

SS due to subplot (SSB) 

= r 2 CF = - (273 2 + 208 2 + 204 2 +116 2 ) - CF = 657.19 
ra k 9 

SS due to Interaction (SSAB) = -LIT£-CF- SSA - SSB 

r j k 

_ —(92 2 + ... + 52 2 )-CF-267.55-657.19 =204.89 
3 

SS due to error (b) = SSTot — SSR — SSA — SSEA — SSB-SSAB 
= 1589.31 - 30.89 - 267.55 - 83.11 - 657.19 - 204.89 


352.67 










































162 _ Experimental Designing and Data Analysis 

Analysis of variance of yields (kg/plot) from a Variety X Planting 
dates trial with Winter Wheat 


ANOVA 

Source 

df 

SS 

MS 

F 

Block 

2 

30.89 

15.44 

0.74 

Date 

2 

267.55 

133.78 

6.44 

Error(a) 

4 

83.11 

20.78 


Variety 

3 1 

657.19 

219.06 

11.18** 

Variety x date 

6 

204.89 

34.15 

T74 

Error (b) 

18 

352.67 

19.59 


Total 

35 

1596.306 

45.61 



** significant at 1%. 


Testing Significance 

1. Tabulated 1% F at 2, 4 df = 18.00 > 6.44. Thus the main 
effect of the planting date is non significant. 

2. Tabulated 1% F at 3,18 df = 5.09 < 11.18. Hence the main 
effect of the variety is highly significant. 

3. Tabulated 5% F at 6, 18 df = 2.66 and the 1% F at 6, 18 df = 
4.01. 

Since 2.66 > 1.74 and also 4.01 > 1.74, the date x variety 
interaction is not significant at the 5% level as well as at the 
1 % level. 


Standard errors: 
Means: 


1 . 


Planting date means: 


SE = 


/ ms r B 
V rq 


/20.78 
V 12 


1.31 


2 . 


Variety means: 



[19.59 


1.47 
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0 4 ^ MSE b 19.59 occ 

3. Date x variety means: SE = J—^ = J—-— = 2.55 


Differences: 

1. Two date means: SE H = = 


^(2)(20.78) 

12 


= 1.86 


T or . |2MSE b /(2)(19.59) 

Two variety means: SE d = /——5- = --- 


Two variety means on 
SE d = = /iS = 3.61 


Two date means for same or different variety: 
SEd= f2[(q-1)MSE b +MSE a ] 

d v r q 


same 


'2[(3)(19.59) + 20.78] 


= 3.64 


= 2.08 


date: 


Further CD value can be calculated 

The results of an experiment to examine the effect of planting 
date on the yield of four varieties of winter wheat are summarized in 
Table 4. 


Table 4: Mean yields (kg/plot) of four winter wheat varieties 
planted on three different dates 

___ Variety __ 

Date _ 1 _2 3 1 4 ^ Mean 


Oct. 10 

30.67 

21.67 

23.33 

16.00 

23.42 

Oct. 20 

37.33 

27.00 

22.00 

22.00 

27.08 

Oct. 30 

23.00 

20.67 

20.67 

17.33 

20.42 

Mean 

30.33 

23.11 

22.67 

18.44 

23.64 


SE: date = 1.31, variety = 1.47; variety x date = 2.55 


Strip Plot Design 

Consider an experiment with methods of seed-bed preparation at 
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three levels: S 1t S 2 and S 3 , and a nitrogen fertilizer factor of four 
levels: N 1t N 2 , N 3 and N 4 . Suppose that we wanted to use machinery 
to apply the nitrogen as well as to do the seed-bed preparation. In 
this case it would be convenient to use large plots for both factors. 
This is possible with the strip-plot design. 

In a strip-plot design the levels of one factor are assigned to 
strips of plots running through the block in one direction. A separate 
randomization is used in each block. The levels of the second factor 
are then applied to strips of plots that are oriented perpendicularly to 
the strips for the first factor. The field plan for two replications of a 
seed bed x nitrogen experiment run as a strip-plot design might 
appear as shown below. 


Replication 1 

Replication II 


s 3 

Si 

S 2 


B 

s 3 

s 2 

Ni 




n 2 




n 2 




n 3 




No 




Nl 




n 3 




N 0 





Field plan for strip-plot design for a seed bed x nitrogen 
experiment 


In the strip plot design there are three sizes of plots: 

1. The strip of four plots on which the seed-bed factor is 
applied. 

2 . The strip of three plots on which the levels of nitrogen are 
applied. 

3. The small plots containing the NS combinations. 

Because there are three size of plot, there are three experimental 
errors, one for each plot size. The interaction is measured with 
greater precision than are the main effects. 
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Split and Strip Plot Design _ ‘ 

Advantages, Disadvantages, and Uses: 

It permits the efficient application of factors that would be difficult 
to apply to small plots. Here both the factors are estimated with low 
precision while the interaction A x B is estimated with higher 
precision. 

The disadvantage of the strip-plot design are similar to those of 
the split-plot design: 

• Differential precision in the estimation of interaction and the main 
effects. 

• Complicated statistical analysis. 

The use of the strip-plot design are similar to the uses of the 
split-plot design: 

• For experiments involving factors that are difficult to apply to 
small plots. 

• To introduce new factors into an experiment already in progress. 

Model 

Let Yp be the observations of the plot belonging to the i th 
replicate, j th levels of factor A and k lh level of second factor B i.e. to 
the treatment combination (ajb k ) in the i th replicate. 

Y,jk = p +n +aj + ejj + b k + ej k + (ab)j k + e^ 

i = 1.2.r, j = 1, 2.p, k = 1,2,.q. 

p = general mean, r, = i ,h block effects 

a-, = effect of j ,h level of factor A b k = effect of k lh level of factor B 

(ab) Jk = Interaction between j th level of factor A and k th level of 
factor B 

The error components e^, e ik and e l]k are independently and 
normally distributed with mean zero and respective variances o 2 a , a 2 b 
and o 2 c 

Prepare R x A, R x B and A x B tables of totals from the data and 
find totals of various replicates and various levels of factors A and B. 
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Total I Rt R 2 R r Total FT R 2 R r Total Ai A 2 


Let Ri R 2 .R r be the totals of r replicates 

Ai A 2 .A p be the totals of p levels of factor A 

B 2 ., B q be the totals of q levels of factor B 

Then N = rpq = Total No. of observations 
IIYijk = IRj - LA, = IB k = G = grand total 


C.F. = G/N 


ANOVA Table 


Source 

d.f. 

s.s. 

M.S. 

F 

Replicates 

r-1 

IR^/pq-C.F. 

R 


Factor A 

P-1 

LA/Vrq-C.F. 

A 

A/E a 

Error (a) 

(r-1) (p-1) 

EE(ri a^/q- C.F. 

SSR-SSA 

E a 


Factor B 

q-i 

ZB k ^/rp-C.F. 

B 

B/E b 

Error (b) 

(r-1) (q-1) 

■ 

"^(r, b k )"/p - 1 
C.F. 

SSR-SSB 

E b 


Interaction A x B 

(p-1) (q-i) 

II(a, b k ) 2 /r-C.F. 

SSA-SSB 

AB 


Error (c) 

(-) 

by subtraction 

E c 


Total 

rpq-1 

ZIIyij^-C.F. 




Standard error of difference and CD values for various 
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comparisons is obtained as follows: 
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Comparison 

S.E. of difference 

5% C.D. 

Between two A 
means say a 1 - a 2 

V 2E a/rq 

V 2 E a /rq xt l( „.(r.i)(p.,)i 

Between two B 
means say - b 2 

. 

Q. 

k_ 

-O 

LU 

^2E b /rp X t [(oi> (r- 1 ) (q- 1 )] 

Between two A 

means at same 
level of B say 
a^ -a 2 b 1 

V2[(q-1)E C fEJ/rq 

SE d x t w 

Between two B 

means at same 

level of A say 

aibi - a-jb 2 

^[(p-IJEc +E b ]/rp 

SE d x t p 


(q-i)E c t c +E a t 
(q-1 )E c +E 


where t a and t c are the t values at error (a) errors (c) degree of 
freedom. 

(p — 1 )E C t c +E b t b 
P (p-1)E c +E b 


where t b is t values at error (b) degree of treatments. 


Remarks: 

Like wise in Split plot design, in Strip plot design too, when the 
number of factors to be tested is more than two then two or more 
factors can be clubbed in factor A or in factor B or in both the f actors 
A and B as per the situation. 

Example 

With a view to formulate optimum spacing schedule for different 
varieties of wheat an experiment was conducted in a strip plot 
design. 






168 


Experimental^ Designing and Data Analysis 


The treatment were: 


Varieties V,, V 2) V 3t V 4f V 5 Spacings S!, S 2) S 3l S 4 

Numbers of replications = 3 

The layout and the yield in kg/plot is given below: 



REPLICATION 1 


1 

REPLICATION II 


s, 

s 2 

s 3 

s 4 


s 3 

Si 

s 4 

s 2 

v 2 

5.6 

2.3 

6.7 

4.93. 

v 4 

3.5 

6.45 

4.8 

6.9 

v 5 

5.46 

5.87 

2.63 

6.78 

Vi 

6.5 

4.69 

1.59 

4.96 

v 3 

2.24 

5.67 

3.48 

6.58 

V 5 

5.32 

6.89 

2.45 

5.36 

Vi 

5.67 

6.89 

2.56 

3.78 

V 2 

4.25 

3.45 

5.69 

4.62 

. v 4 

2.60 

5.65 

3.26 

2.57 

V 3 

2.86 

4.39 

4.68 

2.9 



REPLICATION III 


B 

S2 

s 3 

B 

v 3 

4.26 

6.89 

4.35 

2.89 

v 4 

5.69 

4.89 

4.58 

5.36 

m 

2.56 

6.89 

3.25 

4.60 

V 2 

8.90 

2.68 

4.89 

6.09 

V! 

3.89 

2.68 

1.89 

2.80 


Analysis 


We form two-way tables of var x Repl. spacing x Repl and variety 
x spacing 


Replication 

s 



1 

2 

3 

Total 

| 


1 

18.90 

17.74 

11.26 

mm 

warn 


2 

19.53 

18.01 

22.56 

60.10 

|RS 


3 

17.97 


18.39 

51.19 



4 

14.08 

21.65 

20.52 

56.25 

BOB 


5 

20.74 

20.02 

17.30 

58.06 



Total 

91.22 

92.25 

90.03 

273.50 
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Replications 


1 

1 

2 

3 

Total 

Mean 


1 

21.57 

25.87 

25.3 

72.74 

24.25 


2 

26.38 

24.74 

24.03 

75.15 

25.05 

Spacing. 

3 

18.63 

22.43 

18.96 

60.02 

20.01 


4 

24.64 

19.21 

21.74 

65.59 

21.86 


Total 

91.22 

92.25 

90.03 

273.5 



Spacinqs 





1 

2 

3 

4 

Total 


1 

14.25 

14.53 

10.95 

8.15 

47.9 


2 

17.95 

9.60 

15.84 

16.71 

60.10 

Variety 

3 

10.89 

15.46 

10.69 

14.15 

51.19 


4 

14.74 

17.44 

11.34 

12.73 

56.25 


5 

14.91 

18.12 

11.20 

13.83 

58.06 


Total 

72.74 

75.15 

60.02 

65.59 

273.50 


C.F= (GT > 2 - (273:5) 2 

pq 


Variety SS (VSS) = 

= 8.45 

Replication SS = 


60 

_2X 


= 1246.70 


rq 


- C.F. = 


47.9 2 +.+ 58.06 2 

12 


-C.F. 




pq 


--C.F. =0.123 


Total SS (1) = 


_ ViR^-f.+ V s Rj 


-C.F. 


18.9 2 +.+ 17.3 2 


C.F. =31.48 


SS due to error E, = Total SS (1) - Replication SS - Var SS = 22.91 

_ X s * 


Spacing SS = 


rp 


k __ c p = 7 2.74 +.+ 65.59- 


15 


-C.F. =9.50 
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Total SS ( 2 ) = 


Experimental^ Designing and Data Ana lysis 

_ SiRf+ S 5 R§ 

5 


-C.F. 


SS due to error E 2 = Total SS(2)-Spacing SS - Replication SS = 7.36 

_ SiVi 2 +S 2 V| +. + 


Total SS ( 3 ) = 


-C.F. 


_ 14.25 2 +.+ 13.83 2 


-C.F. =52.25 


SS due to interaction, VS = Total SS( 3 ) - Varieties SS - Spacing 
SS = 34.29 


Total sum of squares = III Y ijk 2 - C.F. = 158.83 
Error sum of squares (E 3 ) 

= Total - Replication SS - Variety SS - Error SS (E^ - Spacing 


SS - Error SS (E 2 ) - (VxS) SS = 76.19 


ANOVA 

Source 

d.f 

SS 

MSS 

F 

Replication 

2 

0.12 

0.06 

0.02 

Variety, V 

4 

8.45 

2.11 

0.74 

Error E, 

- ! - 

8 

22.91 

2.86 


Spacing, S 

3 

9.50 

3.17 

2.58 

Error, E 2 

6 

7.37 

1.23 


Variety x Spacing, VxS 

12 

34.29 

2.86 

0.90 

Error, E 3 

24 

76.19 

3.17 


Total 

59 

158.83 




SE of difference between two variety means 


SE(d) 


2 MSE 1 
V rq - 


0.690 


SE of difference between two spacing means = p^SE 2 _ 0 4045 

V P r 

Standard error of difference between two spacing means at 
same level of variety 



























SE of difference between two varieties at the same level of 
spacing 



□ □ □ 



Chapter -11 


Covariance Analysis 



It often happens that some source of variation which can not be 
controlled by the design can be measured by taking additional 
observations on some characters, called concomitant variable (x) 
which is highly correlated with the character of interest (y). Consider 
the case of rice variety trial in which weed incidence is used as a 
concomitant variable or covariate with grain yield, the character of 
primary interest. Similarly, in wheat crop, we take tiller number as the 


concomitant varable. The covariance analysis can adjust the mean 
yield in each plot to a common level of weed incidence. With the 
adjustment, the variations in yield due to weed incidence is quantified 
and effectively separated from that due to varietal difference. 
Covariance analysis can be applied to any number of covariates and 
to any type of functional relationship between variables. Here we will 
take a case of single covariate whose relationship to the character of 
interest is linear. Analysis of covariance in fact is statistical technique 
and not a design through which the error variation could be reduced 
by increasing the efficiency of the design. 


Assumptions 

1) The effect of different factors i.e. treatments, groups and 
regression are additive. 

2) The yield is distributed normally and independently. 
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3) The covariate X or the concomitant variabte X is not affected by 
the treatments. 

4) The concomitant variable should be highly correlated with 
variable under study. 

Analysis of covariance of RBD data 

Let Yjj denote the yield of i ,h treatment in the j ,h replication, i = 1, 

2,.v and j = 1, 2,.r and X fi be the corresponding value 

of the variable X. Assuming that there is no interaction between 
treatments and plots. We have a 2-way classification with one pair of 
observations per cell. 

Let us assume that 
Yij = |i + t, + b| + p (Xy - X) + e H 
where 

Xjj = observation on concomitant variable 

P = general mean effect 

tj = effect of the i ,h treatment 

bj = effect of the j ,h block 

P = regression coefficient of Y and X 

ey = are random error components independently and 
normally distributed with mean zero and variance o 2 e 

Let 

S Y'i = ^ = Bjj ^ Xy = T ix ; ^ X^ = B jx 

i ' i i 

Z T '=I B i=G; ZT ix =£B ix =G x ; 

' i ' j 

The analysis of covariance will be of the form 
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ANOCOVA 


Source 

df 

Sxx 

Blocks 

(r-1) 

Sxx = ^ B 2 x /k - G x /rk = B xx 
i 

Sxy = ^ BjBj X /k - G x G/rk = B xy 
j 

s YY = £Bj/k-G 2 /rk = B yy 

j 

Treatment 

(v - 1) 

Sxx =^T jx /r-G x /rk = T xx 

i 

Sxy= ^TjT ix /r - G x G/rk = T xy 

i 

Sw = X T'/r - G2 /rk = T yy 

i 

Error 

(r-1) (v-1) 

Sxx = By subtraction(=E xx ) 

Sxy = By subtraction (= E xy ) 

S Y y = By subtraction (= E vv ) 

Total 

(rv-1) 

S X x= SZXij 2 —|-=G„ 

Sxv=ZSX lj Y, = Gxy 

Syy=ZIV iJ 2 ^ =G yy 

Treatment + 

Error 

r(k-1) 

Sxx = T xx +E xx =E xx 

Sxy = T X yH"E X y=E X y 

SYY == Tyy+Eyy=E yy 


Our Hypothesis is H 0 : U = t 2 =.= t k 


Again adjusted Error Mean Square, E, with [(r-1)(v-1) - 1]d.f. 

= Eyy - PExy where p = E xy /E xx 
2 

= Byy ~ (^xy) 

Adjusted Error Mean Square E, (under H 0 ), with [(v-1)-1]d.f 
= E' w - (E\ y f/E' m 
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Adjusted Treatments ss = Ei - E with (v-l)d.f. 

Then the Hypothesis, H 0 is tested by F test, where 
p _ Adj usted Treatment Mean Square 
Adjusted Error Mean Square 

= W i,h (v -1), and [(r-1)( v-1)-1]d.f. 

If F ca , > F, ab with (v-1),[(r-1)(v-1)-1]d.f. at chosen level of 
significance, th null hypothesis is rejected otherwise not. 

The adjusted mean value for i th treatment 

Tj = Yj -p(Xj -X..) 

where Y,=^Y,/r. X,=£x,/r, X~ = ££x,/iv 

i i ' i 

The difference between two adjusted treatment means is given 
by 

T, — T,' = V, — Y' - P(X; - X') 

l 

V(T,-T I ') = — + |.-(X i -Xf ) 2 

So the variance of the estimate of the differences between any 
pair of treatment is not a constant. To make the critical difference a 
constant for each pair of treatment we have Average variance of 
treatment difference. 

1+— ^ 

(v-DEjj 

where Sg is the adjusted error mean square E. If Sg < E yy /(r-1 

(v-1), then we say that analysis covariance is useful. 

The effective error mean square in case of analysis of 
covariance si; is. 

t; 


— — ?Q 2 

V(T, - TiO = -—2- 
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s 


2 ' 
e 





(v-1)E xx 


1 

J 


Per cent gain in efficiency 
covariance as compared to simple 


-1 


x 100 


due to application of analysis of 
analysis is given by 


where s = 


yy 


(r -1) (v -1) 


Example 

An experiment was conducted to study the effect of level of 
protein in ration of cows on milk production. The three levels of 
protein are A, B and C. The design used was RBD with five 
replications. Here Y is the total milk yield for experimental period and 
X is the total milk yield for first 15 pre-experimental days after calving. 

Analyse the data by using information about X for increasing the 
precision and draw conclusions using analysis of covariance 
technique. 


Milk yield (in kg) of experimental animals 




A 


B 

C 

Blocks 

X 

Y 

X 

Y 

X 

Y 

1 

190 

2400 

175 

1880 

173 

2251 

2 

170 

1850 

188 

2250 

225 

2574 

3 

175 

2430 

195 

2455 

190 

2812 

4 

150 

1650 

140 

1858 

162 

1594 

5 

120 

2630 

199 

2776 

144 

2890 

Total 

805 

10960 

897 

11219 

894 

12121 

Mean 

161 

2192 

179.4 

2243.8 

178.8 

2424.2 
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Calculations 


Tix = 805 

Ti =10960 

T 2x = 897 

T 2 = 11219 

T 3x = 894 

T 3 = 12121 

B lx = 538 

B a = 6531 

B 2x = 583 

B 2 = 6674 

B 3x = 560 

B 3 = 7697 

B 4x = 452 

B 4 = 5102 

B 5x = 463 

B 5 = 8296 


G x = £T ix =IB jx =2596 

i 

G = ITj = IBj = 34300 

Various sum of squares can be calculated as follows: 

G 2 

Bxx = -Yb 2 -S- = 453868.67 - 449281.06 = 4587.6 

1 rv 

J 

1 ? G 2 

B yy = — > B 2 -= 80431355.33-78432666.67 

yr V" 1 rv 

j 

= 1998688.67 

B xy = -Y BE. - : —— = 5954030.67 - 5936186.67 

v 1 ,x rv 

j 

= 17844.00 

1 x g 2 

T xx = -Y T jx -— = 450374.00-449281.07 =1092.93 

r 4- 1 rv 

I 

1 G 2 

T yy = -Y T; 2 -= 78581240.40-78432666.67 

rv 

I 

= 148573.73 
1 \ G G 

Txy = -Y Ti 2 T ix -—i = 5944483.4-5936186.67 = 8296.73 

r ' rv 
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g2 

G™ = IIX?-*■ = 459354.00 - 449281.07 = 10072.93 

J rv 

q2 

G vv = IIYif-= 81001822.00 - 78432666.67 = 2569155 33 

‘ rv 

G xy = X ij Y ij ~~~ = 5979360.00-5936186.67 = 43173.33 

Exx = Gxx ~ Bxx - T xx = 10072.93 - 4587.60 - 1092.93 = 4392.40 
Eyy = Gyy-Byy - T yy = 259155.33 - 1998688.67 - 148573.73 
= 421892.93 

E xy = Gxy-Bxy-T^ 43173.33 - 17844.00 - 8296.73 = 17032.6 
Now complete the Analysis of variance table given below 


ANCOVA 


Source 

d.f. 

Sxx 

Sxy 

Syy 

Blocks 

5 

4587.6 (Bxx) 

17844 (Bxy) 

198688.67 

T reatment 

2 

1092.93 (T xx ) 

8296.73 

(Byy) 

148573.73 

Error 

10 

4392.40 (E xx ) 

(Txy) 

17032.6 

(Tyy) 

421892.93 

Total 

17 

10072.93 

(Exy) 

43173.33 

(Eyy) 

2569155.33 

T reatment 
+Error 

12 

5485.33 (E\ x ) 

25629.33 

(E’x.) 

570466.66 
_(E '„) 


Step-ll calculation of adjusted error mean square 


3 ^ 17 032,60 

E xx 4392.40 


Adjusted Error mean square E = E yy 



2 


E 


XX 


= 421892.93 - ll ? Q32 - 6 f 
4392.40 


355844.87 


Adjusted Error mean 
degree of freedom) 


square under the null hypothesis (with 11 
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570466.66 


(25329.33) 2 
5485.33 


453504.69 


Adjusted treatment SS = Ej - E = 5485.33 - 355844.87 
= 97659.81 (with 2 d.f.) 


Adjusted Analysis of variance 


Source 

variation 

of d.f. 

SS 

MS 

F 

T reatment 

2 

97659.81 

48829.9 

1.235 

Error 

9 

355844.87 

39538.31 



Table value of F at 5 % level of significance with 2 and 9 d.f. is 
4.26. So we infer that treatments do not differ significantly among 
themselves 


Step-lll: Calculation of Adjusted treatment means. 
The adjusted treatment means are given by 
T { =yj -p(Xj -x..) 


T x = 2192.00 - 3.87 (161.00 - 173.07) = 2238.71 

T 2 =2219.30 T 3 =2402.02 

Now s 2 = Adjusted error mean square = 39538.31 and the 
effective error mean square 


/ 


s 


2 

e 



1 + 


T. 


XX 


,1 


(v-1)E xx 'J 


= 39538.31 


SE f 


1 + 1,092.93^ 1 = 44457 _ 33 


2x4392.40'J 


' 2 s 2 e' 


2x44457.33 


6 


= 121.73 


If the F value is significant then C.D. value can be calculated in 
the usual way. 


□ n □ 



Chapter -12 


Multi-Locational Experiments 


The purpose of research carried out at experimental stations is to 
formulate recommendations for the practitioners which consists of a 
population quite extensive either in space or time or both. Therefore it 
becomes necessary to ensure that the results obtained from 
researches are valid for at least several seasons in the future and 
over a reasonably heterogeneous space. A single experiment will 
precisely furnish information about only one place where the 
experiment is conducted and about the season in which the 
experiment is conducted. It has thus become a common practice to 
repeat on experiment at different places or over a number of 
occasions to obtain valid recommendations taking in to account place 
to place variations over time or both. In such cases the object of 
interest in analyzing a set of trials is, to estimate the average 
responses to given treatments and to test the consistency of these at 
different places and in different seasons. The test of absence or 
presence of interactions helps us to know whether or not the 
response is consistent from place to place. 

The result of analysis may belong to any one of the following four 
types. 

(1) Experimental errors homogeneous and the interaction 
absent. 

(2) Experimental errors are homogeneous and the interaction 
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present. 

(3) The error heterogeneous and the interactions absent and 

(4) The error heterogeneous and the interaction present. 

Thus while analyzing the data we first determine whether the 
experimental errors are homogeneous or not. 

Test of homogeneity of experimental errors and treatment 
comparisons 

If the magnitude of the experimental errors could be assumed to 
be the same at each place, a simple arithmetic mean of the 
treatment means can be taken to provide an estimate of treatment 
effects over the tract represented by the places and a simple analysis 
of variance on two way table showing treatment and place means 
would give the required information. 

It experimental error differ from place to place, the simple mean 
and analysis of variance can not be used. Therefore, it is necessary 
to examine first the homogeneity of error variances by using Bartlett’s 
test. 

The data are first analyzed in usual way for each place (or year) 

separately. Let there be p mean squares Si 2 , s 2 2 , . Sp 2 

based on respectively n 1f n 2 ,.n p degree of freedom. Here p is 

the number of places or years on which experiment is performed. 
From these values a pooled estimate of variance s 2 is calculated. 

P 2 

ZrijSi 2 

s 2 = —- 

P 

Zn; 

1=1 

Next homogeneity of error variance is tested using Bartlett’s x 

test. 






p 

Srij loge 

i=1 



1 + 


_ 1 _ 

3(p -1) 



bin. 


J 


,1 


1 



IJ 


ix 2 2 

" Xp-i > X table value at a percent level of significance at (p-1) 

d.f then it is concluded that variances are significantly heterogenous 
otherwise error variances are said to be homogenous. 

If the error variances are homogenous, then pooled analysis is 
carried out. The analysis of variances has the following setup for p 
randomized block experiments in t treatments conducted at p sites. 


ANOVA 


Source 

d.f 

M.S. 

Places 

(p-i) 

S 2 d 

T reatments 

(t-D 

——-- 

s 2 t 

Treatment x places 

(i-i) (p-i) 

s 2 

Pooled error 

sum of error d.f of all 
the places 

-s 2 r - 


The error obtained from individual trials are pooled to obtain a 
joint estimate of error variance s 2 e. 


Testing the significance of treatment x place interaction 

In order to see whether treatment x place interaction is significant 
i.e. whether the differences between treatments tends to vary from 
place to place, the mean square for treatment x place is compared 
with the estimate of error variance by F test. If this mean square is 
found to be non significant the interaction is considered to be absent. 

It interaction is assumed to be non-existent the SS for treatment 
X places and the SS for error can be pooled to obtain a more precise 
eshmate of error for testing the significance of treatment and place 
erences. In case treatment x place interaction can not be taken as 
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zero or in other words if the interaction between treatment and place 
is found to be significant, then the appropriate mean square for 
testing the significance of treatment as well as place differences is 
the mean square due to treatment x place. 

We can test the significance of treatment mean square by 
treatment x place interaction. Similarly significance of places means 
can also be tested but usually it has no importance. This way we can 
analyse the data where (i) the errors are homogenous and interaction 
significant (ii) errors are homogenous and interactions non 
significant. 

When the error variances are hetrogeneous, the procedure to be 
followed for testing the treatment difference depends on the 
presence or absence of the treatment x place interaction. The next 
step to be followed therefore consists in making a test of significance 
for the interaction. This is done by method of weighted analysis, the 
weights for each experiment being calculated as 

Wj = r/s 2 j. For these weights, we calculate for each case the 
quantities Wj, p, and Iw^ where pi are the place total for each 
treatment and tjj are means for j-th treatment at i-th place. The 
calculation are given in the following table. 


Places 


T reatment 

s 

1 

2. 

i. 

p 

E w, tjj 

1 

tn 

t21. 

■ tj-J. 

Vi 

-M 

rf 

2 

tl2 

t22. 

tj 2 . 

t p2 

I W,t i2 

i 

j 

tlj 

*2j. 

t'i. 

tpj 

£ WjVj 
j 

T 

tit 

t2t. 

... t it . 

v. 

I w,t lt 

i 

Wj = r 

Wt 

w 2 . 

... Wj. 

Wp 


~s? 






p 

ii 

— M 

Pi 

P2. 

••• Pi. 

• Pp 


Si = I t,j 2 

i 

Si 

S 2. 

* • • Sj. 

. Sp 
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We then compute the following C.F. 

G.T. = IQ: Wjtjj) 
j ' 


(G-T .) 2 

tV Wj 


Where 


Total SS = 


p 


Iw iSi 

i=l 


- CF ; Treatment SS = 


KSw ^) 2 
j j _ 

X Wj 


-CF 


SS due to place (or years) = 


7 Zw iP 2 -CF 

t i=l 


SS due to interaction between treatments and places 

= Total SS - Treatment SS - SS due to places. 

For testing the significance of interaction, we transform the SS 
due to interaction in to x using the formula. 

2 (n-4)(n-2) _ . . . x 

X = ---- (SS due to interaction) 

n(n +1 - 3) 


with d.f. = 


(p-1)(t-1)(n-4) 
(n +1 - 3) 


Where n is the error degree of freedom on which the error mean 
square is based in each experiment. If the interaction is present set 
down the treatment means of each places in two-way table and carry 
out a simple analysis of variance and compare the treatment MS with 
interaction MS. The procedure is known as unweighted analysis. The 
simple arithmetic mean of treatment means here provide the 
estimate of the treatment effect over the tract. It is an approximate 
method 

When the interaction is absent, no general test tor over all 
treatment differences appears to be available, but we can test the 
individual treatment means by the following procedure. 

From each trial set down the responses correspondingly to the 
treatment means in which we may be interested. The following 
weighted mean is then computed. 
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Sw.y* 


y w 


iw; 


where w' = —-— and y is the response for i 1h trial 

v (y i) 


2 • 


X is now given as 


xfp-,) =(P-1) + ,7-^— -6-(P-i) 

V(n-1)i n 


wheree = Sw'y i 2 -y w Zw'y j 

i i 


2 

It x is significant we test the treatments against interaction as in 
the case of unweighted analysis. However if x 2 is non-significant, 
then we compute the statistic. 



which follows the t-distribution with p(n- 1 ) degree 


of freedom. 


Example: A set of trials was conducted at four locations in U.P. 
to compare four treatments. At each place randomized block design 
was used to carry out the experiment. The analysis of each trial 
individually provided an error mean square based on 12 degrees of 
freedom. The total yield in kg for each treatment over five replicates 
at each place as well as error mean squares estimated in each trial 
are given below. 


Blocks 


Treatments 

1 

II 

III 

IV 

1 

146,5 

177.6 

121.5 

155.3 

2 

155.7 

240.0 

150.0 

177.2 

3 

141.8 

158.4 

113.8 

201.4 

4 

80.9 

241.6 

131.9 

207.0 

EMS 

78.17 

28.31 

108.0 

67.9 


Analyse the data and draw conclusions. 


Analysis: 

Step-1: Testing the homogeneity of error variance using Bartlett’s 
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X - test 

s 2 = -£s 2 =70.60 
P i 

where s 2 j are error mean square based on 12 d.f. each. 

2 n[plog e s 2 -Ilog e Si 2 ] 

X(p “ 1) 1 + [(P +1 )/3np] 

2 _ 12[4log, 0 70x 2.3026-2.3026x7.21012] 
X3 ~ 1 + [5/(3x12x4)] 

12x0.42616 „ „„ 

=-= 4.94 

1.3047 


The table value of x at 5 per cent level of significance for 3 
degree of freedom is 7.815. Since calculate value of x ' s l ess than 
the table value of x, we conclude that error mean squares are not 
significantly different from each other. Hence we conclude that error 
variance are homogeneous. 


Step-ll Combined Analysis of the data 


We now make a combined analysis of data treating it at a two- 
way classification in an ordinary repeated trial. 

Blocks 


Treatments 

1 

II 

III 

IV 

Total 

1 

146.5 

177.6 

121.5 

155.3 

600.9 

2 

155.7 

240.0 

150.0 

177.2 

722.9 

3 

141.8 

158.4 

113.8 

201.4 

615.4 

4 

80.9 

241.6 

131.9 

207.0 

661.4 

Total 

524.9 

817.6 

517.2 

740.9 

2600.6 


C.F.= ( 26Q °' 6> - =84539.00 
80 


451498.86 
Total SS =- 

5 


84539.00 = 5760.77 


SS due to places = 88020.92 - 84539.00 = 3481.92 
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SS due to treatments = 84991.62 - 84539.00 = 452.62 
SS due to treatments interactions 
= 5760.77 - 3481.92 - 452.62 = 1826.23 
Pooled error MS = 78.23 + 28.31 + 107.94 + 67.90 = 282.38 


ANOVA 


Source 

D.F. 

S.S. 

M.S. 

F 

Places 

3 

3481.92 

1160.64 

4.11* 

Treatment 

3 

452.62 

150.54 


Interaction (Places 

9 

1826.23 

220.91 


x treatment) 





Pooled error 

64 


282.38 


Total 

79 

5760.77 




Indicate significant at 5% level. 


The F ratio for interaction is 220.91/282.38=0.7823.The value is 
non significant. Assuming interaction to be non significant we can 
pool the sum of squares for interaction and for error and obtained a 
more precise estimate of error. 

(9x220.91+48x282.38)/(9+48)=272.67. Test the treatment mean 
square against pooled estimate of experimental error we get 
F=150.54/272.67=55 which shows the average treatment differences 
over the different places is non significant. 

Example: With a view to studying the effect of use a spraying on 
yield of wheat, an experiment was conducted at Agricultural 
Research Station Arnaj in Gujrat State during 1960-61, 61-62, 62-63 
and 63-64. The experimental sites were choosen a fresh every year. 
The design adopted was randomized block with four replications. The 
treatments were 

N 0 •' Control; : 5.60 Kg/h of N of urea; N 2 : 11.21 Kg/h of N as 
urea, N 3 . 11.21 Kg/h of N as A/S + 5.60 Kg/h N as urea; N 4 : 11.21 
Kg/ha of N as A/S + 11.21 Kg/ha of N as urea. Ammonium sulphate 
was supplied as basal dose and urea applied by spraying in two 
equal doses. The mean yield in Kg/h is given below: 
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Years 


Treatment 

1960-61 

61-62 

62-63 

63-64 

N 0 

863 

1442 

1248 

696 

Ni 

882 

1369 

1017 

712 

n 2 

854 

1407 

1014 

756 

n 3 

860 

1363 

1199 

943 

n 4 

850 

1406 

1307 

955 

EMS 

2106.81 

43430.56 

27989.29 

5069.44 


Analyse the data and draw conclusions. 

Calculation 

Testing the homogeneity of error variance using Bartlett’s test. 

We calculate the following 

S 2 =-ISj 2 = 19649.025; xl = 28.443 
Pi 

The table value of x at 5 P er cent level °f significance for 3 
degree of freedom is 7.815. Since calculated value of x is greater 
than table value of x- We conclude that error mean squares are 
significantly different from one another. Hence we can infer that error 
mean squares are heterogeneous. We shall therefore, analyse the 
data using weighted method of analysis. 


Calculations of various sum of squares. 

Years 


Treatment 

1 

2 

3 

4 

L Wjtjj 

No 

863 

1442 

1248 

696 

2.49404 

Ni 

882 

1369 

1017 

712 

2.50387 

N ? 

854 

1407 

1014 

756 

2.48843 

n 3 

860 

1363 

1199 

943 

2.66950 

n 4 

850 

1406 

1307 

955 

2.67890 

Pi 

4309 

6987 

5785 

4062 


ri 

Wj 

0.00190 

0.00009 

0.00014 

0.00079 


WjDj 

8.18416 

0.62883 

0.80990 

3.20898 


Si 

3714109 

9767779 

6765839 

3364170 
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G = Z(Z W,t n ) = 12.83474; C.F.= ^ G ' T-) = 11282.91 
J i J tlw, 

i 

Total SS = IWiSi -C.F. = 11540.82-11282.91 =257.91 

1(1 W^) 2 

Treatment SS = ——'- CF 

EWi 

i 

= (32.98462/0.00292) - 11282.91 = 13.19 

1 P 9 

SS for years = - £ wp 2 - CF 

t i-1 

= (57391.9973/5) - 11282.91 = 195.49 
SS due to interaction between treatments and year 
= Total SS — Treat SS — SS for year 
= 257.91 -13.19-195.49 =49.23 
Test for Interaction: 

For testing the significance of interaction, transform sum of 
squares for interaction in to using the formula. 

X = [( n- 4) (n-2)]/ [n (n+t-3)] ( SS due to interaction), with 
d-f. = [(p-1) (t-1) (n-4)]/ (n + t-3) 

Thus x 2 = [8 x 10 x 49.23]/[ 12(12 + 5-3)] = 23.44 with 7 d.f. 
(approx.) 

The table value of x at 5 per cent level of significance for 7 d.f. is 
14.067. Since calculated value of z 2 is greater than table value of x 2 , 
it can be inferred that interaction is present. Since interaction is 
present we shall carry out the unweighted analysis. 

Unweighted Analysis: 

C.F. = 447026449/20 = 22351322.45 
Treatment SS = (89609811.00/4) - 22351322.45 = 50905.30 
SS for years = (117351719/5) - 22351322.45 = 1119021. 535 
Table SS = 1260 574.55 
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SS due to years x treatment interaction = Table SS - Treat SS - 
SS for years 

= 1260574.55 - 50905.30 - 1119021.35 = 90647.90 

Analysis of variance: 

_ ANOVA _ 

Source _ D.F. _ S.S. _M 1 S : _ F 

Treatment 4 50905.30 12726.33 1.68 

Years 3 1119021.35 373007.11 49.38** 

Treatment x years 12 90647.90 7553.99 

Pooled error _ 48 78596.10 1637.42 _ 

Since year component comes out to be significant, we can 
conclude that the season has a significant effect on the trials. 

□ n □ 


Chapter -13 


Balanced Incomplete Block Design 


When the number of treatment to be tested is more than the 
block size, the complete block design are to be abandoned for 
experimentation and incomplete block designs are needed in such 
situations. In incomplete block designs, the block size is less than the 
number of treatments to be tested. These designs were introduced 
by Yates in order to eliminate heterogeneity to a greater extent than 
is possible with Randomized Block Design and Latin Square Design 
when the number of treatment is large. Among incomplete block 
designs, the Balanced Incomplete Block Designs and Lattice Designs 
are commonly used by agricultural and biological workers in plant 
breeding and other trials. 

Balanced Incomplete Block Design (BIBD) 

A BIBD is an arrangement of v treatment arranged in b blocks 
each of size k such that. 

1. Every treatment occurs atmost once in a block. 

2. Every treatment occurs exactly in r blocks. 

3. Every pair of treatment occurs together exactly in X blocks, v, 
b, r, k, X are known as the parameter of BIB design and they 
satisfy the relations. 

(i) vr = bk (ii) A.(v-1) = r (k-1), (iii) b < v 

The BIB design is said to be symmetrical if v = b and r = k. 




Blocks 


(j) 

J 

1 

2 

3 

4 

(ii) 

1 

2 

3 

5 

_ Y 11 -- 

(iii) 

1 

2 

4 

5 

MILL _ 

(iv) 

1 

3 

4 

5 

_ V -2- 

M _ 

2 

3 

4 

5 


The blocKS ana me irecumcuio ... 

in experimental layout. The parameter combinations along with the 
plans of the BIB designs in the practically useful range are given in 
the statistical tables by Fisher and Yates (1973), Hafner Pub. Co., 
New York. 


Statistical Analysis 

The mathematical model for BIBD (Intra Block Analysis) 

Yjj = |! + tj + bj + 6ij 

Where y, is the observation recorded on the i th treatment in the j 
block (i = 1 , 2 , .. v, j = 1 , 2 , ... b) 
p is the general mean 
tj is the effect of the i ih treatment 
b, is the effect of the j th block 

e are the random error components assumed to be distributed 
normally about mean zero and constant variance a 2 

In the intra block analysis we assume that the treatment effects 
and the block effects are fixed though unknown and ey’s are 
uncorrelated random variables. On minimizing the error sum of 
squares we get a set of normal equations which can be solved to get 
the estimate of different contrasts of various estimated treatment and 
the block effects. 


we compute 

G = Grand total of observations 

Y = Grand mean = G/n where n = vr = bk total number of 
observations. 
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j. = Sum of observation for treatment i, (i = 1, 2... v) 

= Sum of observation in block j, (j = 1,2.b) 

C.F.= G 2 /n 

q. = Adjusted total for the i lh treatment 

= Tj-[Sum of block totals in which treatment i occurs/block 
size (k)] 

The i ,h effect = (kQj)/?LV i = 1,2.v. 

Adjusted treatment mean for i ,h treatment 
= j ,h treatment effect (tj) + grand mean (y). 

=Ql + y where E = X v/rk and is called the efficiency factor of 
rE 

BIBD. 

Various sum of square can be obtained as follows. 

( 1 ) Total sum of squares (TSS) = I (observation ) 2 - C.F. 

(2) Treatment Sum of squares unadjusted (SST U ) = [XT 2 /r] -C.F. 

(3) Block sum of squares unadjusted (SSB U ) = [EB 2 /k] -C.F. 

(4) Treatment sum of square adjusted (SST a ) 

= It:Q:=Y — Q, 2 = I(kQ|) 2 avk 

1 1 1 ' 


The treatment sum of square can be calculated as 


Treatment 

Number 

Treatment 

total (Tj) 

Blocks 

where i th 

treatment 

occurs 

Sum of the blocks 
containing i lh 

treatment 

j(i) 

kQi = 

kTj 

J(i) 

(kQi) 2 

1 

2 
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(5) Error SS (SSE) = Total SS-Block SS (unadjusted)-Treat SS 
(adjusted) 


ANOVA for a BIBD (v, b, r, k, X) 

Source DF 

SS 

MS 

F 

Block b-1 

SSB U 



(unadjusted) 
Treatment v-1 
(adjusted) 

SST a 

MST 

MST C/ < u. u 

-F(v -1, bk - b- v + -h 

MSE ' 

Error (Intra bk-b- 

By 

MSE 


block) v+1 

subtraction 

SSE 



Total bk-1 

TSS 




Here MST = SST a /v-1 and MSE = SSE/(bk - b-v+1) 

As usual if F ca i < F, abf then treatments do not differ significantly. 
But if F ca , > F ta s, then treatments differ significantly and we can 
proceed further to find the C.D. value. 


Coefficient of variation = 


' M JIx100 

Y 


S.E. of different between two adjusted treatment means = 




2MSE 


rE 


CD at 5 % — t(a, error d.f.) * 


'2MSE 


rE 


In case one is interested to make inferences about block effects, 
the adjusted block mean square is to be compared with error mean 
square. The adjusted block SS can be computed using the following 

identify. 

SS due to blocks adjusted (SSB a ) + SS due to treatments 
unadjusted (SST U ) 

= SS due to Blocks unadjusted (SSB U ) + SS due treatment 
adjusted (SST a ) 

Here SS due to treatment unadjusted (SST U )= iTj / r -C.F. 
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Analysis with recovery of inter block information 

In the intra block analysis it is assumed that the treatment and 
block effects are fixed. The treatment effect may be treated as fixed 
but the block effect should be treated as random variable because 
the block effect in an incomplete block design depend upon the set of 
treatments received by the block. Therefore, in BIBD, between block 
variation on the block comparisons provide error variance. 

Also the block comparisons contains some information on 
treatment comparisons (as some set of treatment occurs in different 
blocks) which can be extracted, if we can estimate the variance 
between blocks with in a replicate which are treated alike. 

If the Inter block information is recovered, the adjusted treatment 
mean is equal to the unadjusted treatment mean plus a correction. 

If E b > E e the adjusted totals can be obtained a Tj = T, + pw, 

-- (b -1) (E b -E e ) 


Where p = 


v(k — 1 )(b — 1 )E b + (v-k)(b-v)E ( 


and Wj = (v-k)Tj - (v-1) y^Bj +(k-1)G 


Then S.S. due to treatment adjusted 


ZT' 2 (IT if 


and MS due to treatment (adjusted) 

= _l[m!-MI 1 

v-1 r vr j 

When inter block information is recovered an approximate F-test 


F [(v-1), bk-v-b+1) d.f. ] 

_ MS among adjusted means 
average effecitve error variacne 




Balanced Incomplete Block. Design 

Effective error variance = E e [1+ (v-k)p] 
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Example 

An experiment on 13 varieties in paddy was conducted at 
Agricultural Research Institute, Rajendra Nagar, Andhra Pradesh, to 
find out the best variety suitable for the tract. The experiment was 
laid out in BIB design with 13 blocks of 4 plots each and each 

treatment replicated four times. The Yield data is given as: __ 

Yield of paddy (in kg/ha) 

Blocks 


1 

23.6 

18.9 

28.0 

24.3 

II 

19.5 

32.3 

22.0 

21.8 

III 

30.7 

725.3 

20.3 

17.2 

IV 

25.3 

26.0 

36.6 

18.4 

V 

20.4 

32.5 

31.8 

33.4 

VI 

32.6 

31.4 

28.2 

32.5 

VII 

33.7 

31.2 

26.7 

31.6 

VIII 

33.5 

32.5 

32.3 

35.9 

IX 

37.2 

31.9 

38.4 

30.3 

X 

27.7 

29.7 

25.9 

34.5 

XI 

36.0 

30.1 

31.1 

29.4 

XII 

32.8 

32.7 

28.8 

40.7 

XIII 

31.3 

30.5 

39.5 

27.7 


Analyse the data 

(i) without recovery of intra block information. 

(ii) with recovery of inter block information. 

Calculations: 

The parameters of the given design are 
b = 13 = number of incomplete blocks. 
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v = 13 = number of varieties of paddy. 

k = 4 = number of plots per block. 

r = 4 = number of times a variety is replicated. 

X = 1 = number of blocks in which a pair of treatment occurs 
together in the given design. 


(i) Calculation of Qji 


\ / 

Sr. 

No. 

Treatment 
total Ti 

Block 
Total Bj 

Serial No. of 
block where 
i ,h treatment 

occurs 

Sum of the 
block which 
contain i ,h 
treatment 

2> 

j(i) 

kTi 

KQi= 

(6M5) 

(kQ.) 2 

. 

1 

2 

3 

4 

5 

6 

7 

8 

1 

1397 

94.8 

7,9,11,12 

521.6 

558.8 

37.2 

1383.8 

? 

116.2 

65.6 

4,9,10,13 

490.9 

464.8 

-26.1 

681.2 

3 

118.2 

93.5 

1,2,8,9 

462.4 

472.8 

10.4 

108.2 

4 

111.9 

106.3 

2,6,10,11 

463.7 

447.6 

-16.1 

259.2 

5 

121.1 

118.1 

4,6,7,8 

488.4 

484.4 

O 

i 

16.0 

6 

110.3 

124.7 

1,6,12,13 

483.5 

441.2 

-42.3 

1789.3 

7 

122.3 

123.2 

5,8,11,13 

506.9 

489.2 

-17.7 

313.3 

8 

130.2 

134.2 

2,4,5,12 

455.0 

520.8 

65.8 

4329.6 

9 

112.4 

137.8 

1,5,7,10 

453.9 

449.6 

-4.3 

18.5 

10 

113.4 

117.8 

3,5,6,9 

474.1 

453.6 

-20.5 

420.3 

11 

91.7 

125.6 

1,3,4,11 

420.2 

366.8 

-53.4 

2851.6 

12 

111.8 

135,0 

2,3,7,13 

441.3 

447.2 

5.9 

34.8 

13 

136.4 

129.0 

3,8,10,12 

480.5 

545.6 

65.1 

4238.0 


1535.6 

1535.6 


6142.4 


0.0 

16443.8 


C.F. 



< 1535 ' 6 J- =45347.4 
13x4 


Total SS 



46932.20 - 45347.40 = 1584.8 
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= 46090.2 - 45347.4 = 742.8 


X(kQi ) 5 


Treatment SS (adjusted) 
16443.8 


vk/l 


4x1x13 


= 316.2 


Treatment SS (unadjusted) = — C.F. 


= 45814.6 - 45347.4 = 467.2 

Error SS (Intra Block) = Total SS - Treatment SS (adjusted) 

- Block SS (unadjusted) 

= 1584.8 - 316.2 - 742.8 = 525.8 


(ii) Analysis of Variance 

ANOVA 


Source 

D.F. 

S.S 

M.S. 

F 

Block (unadjusted) 

12 

742.80 

61.9 

1.4 

Treatment (adjusted) 

12 

316.20 

26.40 

0.6 

Error 

27 

525.80 

43.8 

- 

Total 

51 

1584.80 




Here the treatments do not differ significantly 


Standard error of the difference between two variety means 
(adjusted) 


Efficiency factor of BIBD, E = — = — = 0.8125 

rk 16 


Analysis with recovery of inter block information 


i) Block SS (adjusted) 


Block SS (unadjusted) + Treatment SS 
(adjusted) - Treatment SS 
(unadjusted). 


= 

742.8 + 316.2-525.8 

1 -■—---- 

— 

~533.3 
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(jj) Analysis of variance 


VV -- / 

Source 

D.F. 

S.S. 

M.S. 

F~~~l 

Block (adjusted) 

12 

533.30 

44.40 E b 

1.00 ~ 

_l— — -■ ■ ■ 

T reatment 
(unadjusted) 

12 

467.20 

38.90 

0.90 

Error 

27 

525.80 



Total 

51 

1584.80 




We have E b = 44.4 and E e = 43.8 


Here E b > E e we can proceed for the analysis of inter block 
information 


(iii) Calculation of ji and Wj 

_ (b-1)(E b -E e ) _ = 12x0.60 

v (k -1) (b-1) E b + (v-k) (b-v)E e 39x12x44.40 

w, = (v-k) Ti - (v-1) £ B j + (k -1 )G 

j(i) 


0.0003 


Calculation of Wj using above equation 


Treatmen 
t No. 

Treatmen 
t total Ti 

Serial 

No. of 
block 

Sum of Wi 
the block 
totals 

T,* = 

Ti + 0 Wi 

Adjusted 

mean 

(6)/r 



where i ,h 

where i ,h 




treatment 

treatment 





occurs 

occurs 



1 

2 

3 

4 5 

6 

7 


1 . 

2 . 

3 . 

4 . 

5 . 

6 . 

7 . 

8 . 

9 . 

10 . 
11 . 
12 . 
13 . 


139.7 
116.2 
118.2 
111.9 
121.1 

110.3 

122.3 
130.2 

112.4 

113.4 

91.7 
111.8 

136.4 


7 . 9 . 11.12 

4 . 9 . 10.13 

1 , 2 , 8,9 

2 , 6 , 10,11 

4 . 6 . 7.8 

1 . 6 . 12.13 

5 . 8 . 11.13 

2 . 4 . 5.12 

1 . 5 . 7.10 

3 . 5 . 6.9 

1 . 3 . 4.11 

2 . 3 . 7.13 
3 , 8 , 10,12 


521.6 

490.9 

462.4 

463.7 

488.4 

483.5 

506.9 
455.0 

453.9 

474.1 

420.2 

441.3 

480.5 


841.5 
75.0 
- 3.0 

242.7 

- 116.1 

305.1 
162.9 
- 471.0 

223.2 

184.2 
1030.5 

246.6 
- 712.8 


139.4 
116.2 
118.2 
112.0 
121.1 

110.4 
122.3 

130.1 

112.5 

113.5 
92.0 

111.9 

136.2 


34.9 

29.1 

29.5 
28.0 

30.3 

27.6 

30.6 
32.5 

28.1 

28.4 
23.0 
28.0 
34.0 
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Treatment SS (adjusted) 



= 45803.70 - 45347.40 = 456.30 


M.S. due to treatment (adjusted) 


456.30 

12 


38.02 


F (12 and 27 d.f.) 


M.S.(adjusted) due to treatment 
Average effective error variance 


38.02 

43.93 


0.86 (NS) 


where average effective error variance = E e [1+ (v-k)ji] 

= 43.80 x 1.003 = 43.93 

The treatment effects are not significantly different among 
themselves. 


n □ □ 


Chapter -14 


Lattice Designs 


Balanced incomplete block (BIB) designs are the most efficient 
designs in the class of binary incomplete block designs but these 
designs require usually a large number of replications and are not 
available for all combinations of parameters values. To over come 
these difficulties Yates (1936) introduced a class of designs known 
as Lattice Designs. These designs are most commonly used in 
agricultural research. There is sufficient flexibility in the design to 
make its application simpler than most other incomplete block 
designs. The characteristic features of these designs are that the 
number of treatment is a perfect square and the block size is the 
square root of this number. We have balanced lattice and the partially 
balanced lattice designs. Both require that the number of treatments 
must be a perfect square. 

Balanced lattice 

The balanced lattice design is characterized by the following 
basic features. 

1. The number of treatment (v) must be a perfect square (i.e. v = s 2 , 
such as 25, 36, 49, 64, 81, 100 etc.). As the number of treatment 
becomes large, adding a two more or eliminating some less 
important treatments is usually easy to accomplish. For example 
if a plant breeder wishes to test the performance of 80 varieties 
in a balanced lattice design, all he needs to do is add one more 
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variety for a perfect square or has 82 or 83 varieties to start he 
can easily eliminate one or two. 

2. The block size (k) is equal to the square root of the number of 
treatment (i.e.k = Vv ). 

3. The number of replications (r) is one more than the block size 
[i.e. r = (k+1)]. The number of replications required is 6 for 25 
treatment, 7 for 36 treatment and so on. 

Randomization and layout 

We illustrate the randomization and layout of a balanced lattice 
design with a field experiment involving nine treatments. There are 
four replications, each consisting of three incomplete blocks with 
each block containing three experimental plots. 


Step-1 Divide the experimental area in to r = (k+1) replications, 
each containing v = k 2 experimental plot. 



Step-ll Divided each replication in to k incomplete blocks each 
containing k experimental plots. 


Step-lll Select a basic balanced lattice plan corresponding to the 
number of treatments to be tested. For our example, the basic plan to 
3x3 balanced lattice design is 


Incomplete 

Block 

Number 

Treatments Numbers 

m 

1 

HI 

1 

Rep. Ill 

1 

Rep. IV 

1 

i 

2 

3 


1 

D 

D 


1 

5 

9 

□ 

1 

6 

8 

2 

D 

5 

6 


2 

5 

8 


2 

6 

7 


2 

D 

9 

3 

7 

8 

9 


3 

6 

9 


3 

4 

8 


3 

5 

7 


Step IV Randomize the replication arrangement of the selected 
basic plan following an appropriate randomization scheme. The 
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outcome of new plan at this step is 


UUIUVJI V 

Incomplete 

Block 

Number 

Treatments Numbers 




1 

HI 

1 

Ri 

ep. 1 

II 


R< 

-1 

;p. 1 

-1 

V 

1 


a 

7 

■ 


2 

3 

■ 


5 

9 




8 

2 


H 

8 


a 

5 

6 


2 


7 


2 

D 

6 

3 

3 

6 

9 


LZ_ 

8 

9 


3 

□ 

8 


3 

l_5_ 

7 


Step V Randomize the incomplete blocks with in each 
replication. After four independent randomization processes, the 


reassigned incomplete blocks may be shown a. 


111 

Incomplete 

Block 

Number in 

Basic Plan 

Reassigned Incomplete Block Number in New Plan 

Rep. 1 


Rep. II 


Rep. Ill 


Rep. W 

-1- 


1 

■ 

3 

■ 

■ 

■ 

2 

■ 


■ 


| 

— 

Hi 


■ 

2 

■ 

2 

■ 

■ 

■ 


■ 

■ 

■ 


■ 

■ 

■ 

3 

■ 

3 

■ 

1 


■ 

■ 

3 


■ 

■ 

1 2 


■ 

■ 

2 

■ 


The out come of the new plan at this step is. 


Incomplete 

Block 

Number 

Treatments Numbers 

Rep. 1 


Rep. II 


Rep. Ill 

— 

i 

Rep. IV 

1 

3 

6 

9 


B 

5 

6 



5 

7 



5 | 

9 

2 

2 

5 

8 



2 

3 



6 

8 



1 4 

1 8 

3 


D 

7 


7 

8 

9 


2 

B 

_9_ 

_ _ 

IB 

B 



Step-VI: Randomize the treatment arrangement with in each 
incomplete block, the reassigned treatment sequence may be shown 
as: 


Incomplete 

Block 

Number 

Treatments Numbers 

Rep. 1 


Rep. II 


Rep. Ill 


Rep. IV 

1 

9 

3 

6 


5 

B 

6 


7 

5 

3 


uU 

9 

— 


2 

8 

5 

2 


3 

2 

1 


6 

8 

1 


IB 



3 

7 

1 

B 


9 

8 

7 


2 

B 



IB 


UlI 











































































































Block 1 
Block 2 
Block 3 


|t 9 

t 3 

t 6 

00 

1— 

t 5 

t 2 

t 7 

Ti 

t 4 


t 5 

t 4 

CD 

1- 

t 3 

C\J 

Ti 

t 9 

—1 

00 

t 7 


Replication I Rep. II 


— 1 

"'si 

to 

H" 

H 

CO 

t 6 

t 8 

Ti 

t 2 

1- 

t 9 


Ti 

t 9 

t 5 

t 4 

00 

h- 

t 3 

t 7 

t 2 

t 6 


ReD. 


Ren. IV 


Here important feature in the layout of balanced lattice design is 
that every pair of treatment occurs.together only once in the same 
block. For example, treatment 1 appears only once with treatment 4 
and 7 in block 3 of replication I, with treatment 2 and 3 in block 2 of 
replication in II with treatment 6 and 8 in block 2 of replication III and 
with treatment 5 and 9 in block I of replication IV. As a consequence 
of this feature, degree of precision for comparing each pair of 
treatment in a balanced lattice design is the same for all the pairs. 


Analysis: There are four sources of variation that can be 
accounted for in a balanced design: replication, treatment, 
incomplete block and experimental error. As compared to RBD, the 
incomplete blocks is an additional source of variation and reflects the 
differences among incomplete blocks of the same replication. 


The data of the 16 rice fertilizer treatments in 4 x4 balanced 
lattice designs with five replications given below has been used for 
computational procedure for the analysis of variance. 


Tillers number per square meter from 16 fertilizer treatments 


tested in a 4 x 4 Balanced Lattice Design. 


Blocks 

Numbers 

Tillers No/m 2 

Block 

Totals 


Rep 1 


1 

BIBM 


ism 

IB 

616 

2 

IE3S9 


mm 


616 

3 



BBBM 

li£)iua 

616 

4 




RH 

747 

Rep. Total Ri 

2595 
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Tillers No/m' 


Rep. II 


(9) 


(13) 

152 


( 6 ) 

142 




783 


2729 


Blocks 

Numbers 


Rep. Total R 3 


Tillers No/m' 


Rep III 


Block 

Totals 


9 

(D 

(6) 

(11) 

(16) 



155 

162 

177 

152 

646 

10 

(5) 

(2) 

(15) 

(12) 



182 

130 

177 

165 

654 

11 


(14) 

(3) 

(8) 



137 

185 

152 

152 

626 

12 

(13) 

(10) 

(7) 

(4) 




122 

182 

192 

681 


2607 


Blocks 

Numbers 


13 


14 


15 


16 


Rep. Total R 


Tillers No/m' 


Rep. IV 



Block 

Totals 


802 


724 


675 


689 


2890 
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Blocks 

Tillers No/m 2 

Block 

Numbers 

Rei 

pV 

Totals 

17 

(D 

147 

(10) 

112 

(15) 

177 

(8) 

147 

583 

18 

(9) 

180 

(2) 

205 

(7) 

190 

(16) 

167 

742 

19 

(13) 

172 

(6) 

212 

(3) 

197 

(12) 

192 

773 

20 

(3) 

177 

(14) 

220 

(11) 

205 

(4) 

225 

827 

Rep. Total R 5 

2925 


The values in parenthesis correspond to the treatment numbers. 


Step-1: Calculate the block totals (B) and replication totals (R) 
treatment totals (T) and the grand Total (G). 

Step-2: For each treatment calculate the B t value as the sum of 
block totals over all blocks in which the particular treatment appears. 
For example treatment 5 occurs in blocks 2, 5, 10, 15, and 20. Thus 
B, for treatment 5 is computed is the sum of the blocks totals of the 
blocks 2, 5, 10, 15 and 20 or B 5 = 616 + 639 + 654 + 675 + 827 = 
3411. 


Note that sum of all B t values over all the treatments must be 
equal to (K) (G) where K is the block size. 

Table: Computations of the adjusted and unadjusted treatment 
totals for the 4x4 balanced lattice design. 


Treatment 

Number 

Treatment 

Total 

(T) 

1^1 

W = 

4T-5 B t + 

G 

T' = 

T+ p.W 

M' = 

T'/5 

1 

809 

3286 

552 

829 

166 

2 

794 

3322 

312 

805 

161 

3 

908 

3411 

323 

920 

184 

4 

901 

3596 

-630 

878 

176 

5 

816 

3411 

-45 

814 

163 
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1 Treatment 
Number 

Treatment 

Total 

(T) 

Block 

Total 

s 

(B.) 

W = 

4T-5 B t + 

G 

T' = 
T+ pW 

M' = 

T'/5 

6 

848 

3310 

588 

869 

174 

7 

864 

3562 

-608 

842 

168 

8 

865 

3332 

546 

885 

177 

9 

801 

3312 

390 

815 

163 

10 

581 

3141 

365 

594 

119 

11 

946 

3534 

-140 

941 

188 

12 

971 

3628 

-510 

953 

191 

13 

869 

3564 

-598 

848 

170 

14 

994 

3588 

-218 

986 

197 

15 

913 

3394 

428 

928 

186 

16 

866 

3593 

-755 

839 

168 

Sum 

■Em 

54984 

0 

” 

- 


Step-3: For each treatment, calculate 
W = KT - (k+1) B t + G 

For example the value of W for treatment 5 is computed as 

W 5 = 4 (816) - (5) (3411) + 13746 = -45 sum of W values over all 
the treatments must be zero. 


Step-4: Compute the total SS, the replication SS and the 
treatment (unadjusted) SS as. 

CF . = - g !- = < 13 ' 746 > 2 =2361906 
k 2 (k + 1) (16)(5) 

Total SS = IX 2 -CF 

= [(147) 2 + (152) 2 .+ (225) 2 ] - 2 361906 = 58856 

Replication SS = ^--CF 


(2595) 2 +(2729) 2 ..+ (2925) 2 ,1 


16 


U 


-2361906 =5946 
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IT : 


-CF 


Treatment (unadj) SS = 

(809) 2 +(794) 2 .+ (866) 2 1_ 

5 J 


2361906 = 26995 


Step-5: Compute the block (adjusted) SS [i.e., the sum of square 
for blocks with in replication adjusted for treatment effects) as: 


Block (adj) SS= 


v 

ZWj 

i=i 


(k 3 )(k + 1) 


(552) 2 +(312) 2 .+ (-755) 2 |l 


64x5 


U 


= 11382 


Step-6: Intra block error SS=Total SS-Rep SS-Treat (unadj) SS- 
Block (adj) SS 

= 58856 - 5946 - 26995 - 11392 = 14533 

Step-7: Complete the block (adj) mean square and the intra 
block error mean square as: 

Block (adj) MS = -——— = —— =759 


Intra block error MS = — - ra block error SS = 14533 = 323 

(k-1)(k 2 -1) (3) (15) 

Step-8: For each treatment calculate the adjusted treatment 
totals T' as 

T' = T + pW 

where p - Block ( ad J) MS'Intra block error 

. k 2 [Block (adj) MS] 

If the intra block error MS is greater than the block (adj) MS, p is 
taken to be zero and no adjustment for treatment is necessary. The F 
test for significance of treatment effect is then made in the usual 
manner as the ratio of treatment(unadj) MS and intra block error MS 
and steps 8 to 12 and step 15 can be ignored. 
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In our example the intra block error MS is smaller than the block 
(adj) MS. The adjustment factor p is computed as 

_ 759 - 329 = Q35g The j, va j ue for treatment 5 is computed as 
H 16(759) 

Tg / = 816 + (.0359) (-45) = 814. 

Step-9: For each treatment, calculate the adjusted treatment 
mean M' as 

T' Q 1 A 

M' = —— Here M' 5 = — = 163 
k + l 5 


Step 10: Compute the adjustment treatment mean square as 


Treatment (adj) MS = 


1 


i 


L(k + 1)(k 2 -1)|j 


ST 


/2 


G 2 ,l 


U 


.If, 


L(5) (15) J | 


S 1(829) + (805) + 


+ 


(839) 2 ]- 


(13746)' 

16 


= 1602 

Step 11 : Complete the effective error MS as 
Effective error MS = (Intra Block error MS) (1 + kp) 

= 323 [1+4 (0.0359)] =369 


CV. = / Effective error M .S.^ 
grand mean 


[369 
" V 172 


x 100 = 11.2% 


Step 12: Compute the F value for testing the treatment 
differences as: 


_ treatment (adj) MS 1602 . _. 

Effective Error MS 369 

Step 13: Compare the computed F value to the tabular F values 
with ni = (k 2 -1) = 15 and n 2 = (k-1) (k 2 -1) = 45 degree of freedom. As 
the computed F value is larger then the tabular F values (2.47) at 1% 
level of significance, the treatment difference is judged to be highly 
significant. 
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Step 14: Construct the analysis of variance table 

Source of Degree of Sum of Mean 
variation freedom squares squares 


Comp 
Comp 
uted F 


Replication 



k-1=15 


k-1=15 


5946 


26995 


11382 


Intra 

error 



block (k -1) (k 2 -1) = 45 14533 


(k-1) = 15 



Effective error 


Total 



r(k+1) -1 =79 


Step-15: Estimate gain in precision of a balanced lattice design 
relative to RBD design as: 

^ £ _ 100 [Block (adj) SS + Intra block error SS] 
k (k 2 -1) (Effective error MS) 


_ 100 (11,382 + 14,533 
4(16-1) (369) 


= 117% 


Thus the use of 4 x 4 balanced lattice design is estimated to 
have increased the experimental precision by 17% over that which 
would have been obtained with RBD. 

Partially Balanced Lattice 

The partially balanced lattice design is similar to the balanced 
lattice design but allows a more flexible choice of number of 
replications. Here also the treatment number must be a perfect 
square and the block size is equal to the square root of this treatment 
number, the number of replication is not prescribed as a function of 

ana? II hT tment '*** a " V " Umber of re P lication can be used in 
a partially balanced design. 

With two replications, the partially balanced lattice design is 
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referred as simple lattice, with three replications a triple lattice and 
with four replications a quadruple lattice and so on. Such flexibility in 
the choice of number of replications results in a loss of symmetry in 
the arrangement of treatment over blocks (i.e. some treatment pairs 
never appear together in the same incomplete block). Consequently, 
the treatment pa.rs that are tested in the same incomplete block are 
compared with a level of precision that is higher than those that are 
not tested in the same incomplete block. So there is more than one 
level of precision for comparing treatment means and data analysis 
become more complicated. 



Chapter - 15 


Layout and Analysis of Augmented Designs 


In Genetic resources environment, an essential activity is to test 
or evaluate the new germplasm/provenances/superior selections 
(new treatments with the existing provenances or released varieties 
(checks or control treatments). A problem in these evaluation studies 
is that quantity of the genetic material collected from the exploration 
trips is very limited or can not be made available since a part of this is 
to be deposited in Genebank. The available quantity of seed is often 
not sufficient for replication. Moreover the number of new germplasm 
or provenances to be tested is very high (usually about 1000-2000 
and some time up to 3000 associations) and it is very difficult to 
maintain block homogeneity. RBD can not be used as soil hetrogenity 
becomes unmanageable. In absence of error term (error variance) in 
the ANOVA, without replication, test of significance cannot be 
applied. Both these problems can be surmounted by employing 
Augmented (or Honnuiaku) designs. Because of simplicity and 
usefulness, these designs have applications in the fields of plant 
breeding, entomology, pathology, chemistry, physiology, agronomy 
and perhaps others for screening experiments on new materia! and 
preliminary testing of experiments on promising material. Thus we 
have to design an experiment in which the experimental material for 
new (test) treatments is enough only for single replication. However, 
to ensure connectness property of the design i.e. estimation of all 
possible paired comparisons and estimation of error, at least two 
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replications on all treatments are required in traditional block designs 
viz. randomized complete block design, balanced incomplete block 
designs, etc. In augmented designs, connectness is ensured by 
augmenting any standard connected design in control treatments 
with new (test) treatments and replication of the control provide the 
estimate of error. More precisely, an augmented experimental design 
is any standard experimental design in controls augmented with 
additional treatments in the complete block, incomplete block, the 
( row, the column etc. The blocks in an augmented design may be of 
unequal size. Federer (1956) proposed three models. 

1. Augmented completely randomized design (Augmented Design 

I). 

2. Augmented randomized complete block design (Augmented 
Design II). 

3. Augmented Latin Square Design (augmented Design III). 

Only the first two design (I and II) satisfying the above two 
situations are widely used by plant breeders, hence elaborated 

below: 

Augmented Design I 

Concept: When number of seeds is a limitation, the augmented 
completely randomized design (I) is most suited. In this design the 
whole experimental area is divided in to N plots (where N is equal to 
the number of test genotypes (V) + number of checks (C) which are 
standard varieties or hybrids of known performance) repeated b 
times, i.e. N = V + bC; and total number of entries, e = V+C. All the N 
plots are allotted randomly for all V and C, the later repeated b times. 
Blocks are not marked at all. A representative field design with V=8 
(V 1 to V 8 ). C = 4 (Ci to C 4 ), b = 3, e = 12 and N = 20 in a linseed trial 
is depicted below: 


Table 1: Field layout and data collection: Augmented design I 


C 3 

V 6 

C 4 

C3 

VI 

C4 

Ml 

C4 

V4 

C2 

c, 

V 3 

C2 

Cl 

C3 

V5 

Cl 

V8 

C2 
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The following data on number of tillers per plant were collected 
on 5 random plants per plot and the average over sample is 
presented in table 2 . 


Table 2: Average data on number of tillers in linseed varieties 
and checks under augmented design I 


Checks 

(Ci) 

Symbolic data 



Actual data 



Repeats (bj) 

Total 

(Toi) 

Mean 

Repeats 


Total 

Mean 

1 

2 

3 

( Ci) 

1 

2 

3 


c, 

Xi 

x 2 

x 3 

TC! 

Cl 

10 

8 

6 

24 

8.0 

c 2 

x 4 

x 5 

x 6 

Tc 2 

C 2 

3 

5 

6 

14 

4.7 

C 3 

x 7 

x 0 

x 9 

Tc 3 

C 3 

10 

9 

8 

27 

9.0 

c 4 

X 10 

Xu 

X 12 

Tc 4 

C 4 

15 

14 

18 

47 

15.7 

Total 

KSl 



Tc 

c 




112 

9.33 


Varieties (Vi) 

Symbolic data (Vi) 


Actual data 

v 1 

Xl 3 


6 

V 2 

x 14 


40 

v 3 

Xl5 


4 

v 4 

Xl 6 


24 

V 5 

X 1 7 


11 

V 6 

Xl 8 


18 

V 7 

X 19 


8 

V 8 

X 2 0 


30 

Total (V) 

Tv 


141 

Grand Total GT = (V+bc) 

= 141+112 = 253 

V = T/V 

= 141/8 = 17.62 

Table 3 : The skeleton of ANOVA for augmented design 1. 

Source of variation 

d.f. 

ss 

MSS 

Among entries (e) 

e -1 =11 

eSS 

eMS 

Among checks (C) 

C -1 =3 

cSS 

cMS 

Among varieties (V) 

V -1 =7 

vSS 

vMS 

C Vs. V 

1 

cvSS 

cvMS 

Error 

C (b- 1 )= 8 

ESS 

EMS 

Total 

N-1 = 19 

TSS 
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Statistical analysis: All the components of ANOVA can now be 
developed as under: 

i) GCF (General correction factor) = (GT) /N 
= (253) 2 /20 = 3200.4 

ii) TSS = V ' 2 "" GCF =Xl2 + + + X * 12 + 


i=l j=l i=l 
Xi 3 2 + — + x 2 o 2 - GCF 

= 10 2 + 8 2 + — + 14 2 + 18 2 + 6 2 + 40 2 + — + 30 2 - 3200.4 
= 4897.0 - 3200.4 = 1696.6 

iii) eSS = t Tc ' 2/3 + I V i- GCF 

i=1 i=1 

= (TC! 2 + Tc 2 2 + — + Tc 4 2 ) / 3+ Vi 2 + — + v 8 2 - GCF 

= 24 2 +14 2 +27 2 +47 2 + q2 +4Q 2 + -+30 2 -3200.4 

3 


= 1236.6 + 3637.0 - 3200.4 = 1673.2 
eMS = eSS/(e-1) = 1673.2/11 = 152.1 

iv) cCF = T c 2 / be = 112 2 / (3 x 4) = 112 2 /12 = 1045.3 

v) cSS = ^Tci 2 /3-cCF 

i 

- 24 2 +14 2 +27 2 +47^ _ 1 Q45 3 = -j 2 36,6 -1045.3 = 191 3 
3 

vi) vCF = Tv 2 /v= 141 2 /8 = 2485.1 

V 

vii) vSS - 7 Vi 2 -vCF =6 2 +40 2 h -h8 2 +30 2 -2485.1 


vMS 


= 3637.0-2485.1 = 1151.9 
= vSS / (v-1) = 1151.9 / 7 = 164.5 
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viii) cvSS = eSS - cSS - vSS = 1673.2 -191.3-1151.9 = 330.0 
cvMS = cvSS/1 = 330.0/1 = 330.0 

ix) ESS = TSS - eSS = 1696.6 - 1673.2 = 23.4 
EMS = ESS / c (b-1) = 23.4/4 (3-1) = 23.4/8 = 2.9 

x) F (calculated) = eMS / EMS (for entries) = 152.1/2.9 = 52.4 
= eMS / EMS (for checks) = 63.7/2.9 = 21.9 

= VMS / EMS (for varieties) = 164.5 / 2.9 = 56.7 
= cvMS / EMS (for checks Vs Var S) = 330.1 / 2.9 = 113.8 

xi) Standard error of mean difference : 

SEd 1 (between 2 checks) = 

2EMS (2x2.9 2.4 

—u— = a —^— =-= 1 -38 

b V 3 1.73 


SEd 2 (between 2 vars.) = V2EMS = ^2x2. 9 = 2.40 
SEd 3 (between a check and variety) = 


1 

fi + l] 

r~ 

full 

Jems 

= 2.9 


l b J 

V 

v 3 J 


= V 2.9x1.33 = V3.86 = 1.96 


CD = corresponding SEd x t (5% or 1%) for error d.f. (=8) 

Presentation of results :The whole ANOVA is presented now as 
under Table 4. 


Table 4 : ANOVA (Augmented Design I) 


Source of variation 

d.f. 

SS 

MSS 

F 

Among entries 

11 

1673.2 

152.1 

52.4** 

Among checks 

3 

191.3 

63.7 

21 . 9 ** 

Among varieties 

7 

1151.9 

164.5 

56.7** 

Checks Vs. varieties 

1 

330.0 

330.0 

113 . 8 ** 

Error 

8 

23.4 

2.9 


Total 

19 

1696.6 



p<0.01 (significant at 1% level of significance) 

With this table of ANOVA, table of means must be presented as 
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shown below (Table 5) 

Table 5 : Mean Performance. 


Checks / 

Var. 

No. of tillers 

per plant 

Vars. 

No. of tiller / plant 

c, 

8.0 

V 5 

11 

c 2 

4.7 

v 6 

18 

C3 

9.0 

v 7 

8 

C 4 

15.7 

Vs. 

30 

C 

9.33 

V 

17.62 

Vi 

6 



V 2 

40 

CD^t 5% 

1.38x2.306 = 3.182 

V 3 

4 

CD 2 at 5% 

2.40 x 2.306 = 5.5.34 

V 4 

24 

CD 3 at 5% 

1.96x2.306 = 4.519 


Interpretation of results 

(i) All the variance among entries / checks / varieties and the single 
degree of freedom comparison (checks Vs. varieties) are highly 
significant (p<0.01) 

(ii) Test variety V 2 and V 8 are significantly highest tillering and V 3 and 
Vi the lowest. 

(iii) Test varieties are significantly superior over checks ( C = 9.33 < 

V= 17.62). 

Augmented design I can be extended to any number of V and C 
on the same line as above. 

Augmented Design II 

Concept: This is a slightly modified version of augmented design 
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I, where instead of repeating all the checks b times throughout the 
field, the whole field is divided distinctly into b-blocks. Then 
randomization is done such that all the checks and a part of test 
genotypes fall only once in each block. Thus the total number of plots 
(N) remains, the same as in design I (if same number of V and C are 
used) but the number of plots in each block is variable. The 
advantage of such a stratification (block formation) of area is to 
reduce the EMS by apportioning the block effect which is confounded 
with error in design I. Actually, what RBD does by replicating only the 
checks (C), thus saving space. 

Field plan and randomization: Thus can be illustrated using the 
example given in augmented design I i.e. V = 8 (N^ to V 8 ), C = 4 (Ct 
to C 4 ) b = 3, e = 12 and N = 20. Now divide the whole experimental 
field into (b=)3 distinct blocks. 

Block 1, consisting of (say) 7 plots (Ci to C 4 + \A V 4 and V 8 ) 

Block 2, comprising (say) 6 plots (C^ to C 4 + V 2 and V 3 ) and 

Block 3, encompassing (say) 7 plots ( C, to C 4 + V 5 , V 6 and V 7 ) 

So as to construct relatively homogenous stratification of area in 
each block. Thus number of test varieties falling in each block are n = 
3, 2 and 3, respectively. Randomize all the entries (check plus 
varieties) into each block as depicted below: 

Field layout: Augment Design II 


Block 1 Block 2 


C 2 

C 4 

< 

00 

Cl 

Vi 

CO 

o 

V 4 

CO 

> 

C 3 

i c 2 

CNi 

> 

c. 

C 4 

V 7 

C 3 

Cl 

V 5 

c 2 

v 6 

C 4 


Block 2 contd... Block 3 


c) Statistical analysis: This can be shown with the same data 
on number of tillers in 8 varieties and 4 checks (replicated 3 times) of 

linseed as in design 1 but average data arranged block-wise as 
under Table 6. 
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Table 6: Number of tillers in 8 varieties and 4 checks of linseed 
grown in augmented design II. 

Checks Symbolic data 


Actual data 



(Ci) Replication (bj) Total 

Mean 

Replication 

Total Mean 

1 

2 

3 (Tc) 

(Ci) 

1 2 3 



Ci xi 

X2 

X3 Tci 

Ci 

10 8 6 

24 

8.0 

C 2 X4 

X5 

X6 TC2 

C 2 

3 5 6 

14 

4.7 

C 3 X7 

X8 

X9 TC3 

C 3 

10 9 8 

27 

9.0 

O 

X 

Tl- 

O 

Xll 

X12 TC4 

C 4 

15 14 18 

47 

15.7 

Total Tcbi 

Tcb 2 

Tcb3 Tc 

c 

38 36 38 

112 

37.3 

(C) 







Test var 

(Vi) Symbolic data 


Actual data 



(unadjusted) 

Replication (bj) 


Replication 




1 

2 3 


1 2 

3 



V 8 

V 3 V 7 


30 4 

8 



V, 

V 2 V 5 


6 40 

11 



V 4 

v 6 


24 

18 


Total (V) 

Tvbl 

Tvb 2 Tvb3 Tv 

60 44 

37 

141 

Total 

Tbi 

Tb2 Tb3 

GT 

98 80 

75 

253 

Table 7: The skeleton of ANOVA for augmented design II 

Source 

of 

d.f. 

ss 

MSS 

F 


variation 







Blocks 


b-1 =2 

bSS 

bMS 

bMS/EMS 

Entries 


e-1 = 11 

eSS 

eMS 

eMS/EMS 

Checks 


c-1 = 3 

cSS 

cMS 

cMS/EMS 

Varieties 


V-1 =7 

vSS 

VMS 

vMS/E MS 

Checks Vs. 


1 

cvSS 

cvMS 

cvMS/EMS 

varieties 







Error 


(c-1) (b-1)=6 

ESS 

EMS 



_Total 


N-1 = 19 

TSS 







Now different components of ANOVA can be calculated as 
follows: 

However, since only the checks (c) but not the test varieties (v) 
are replicated in this design, adjustment of means of v must be done 
before ANOVA with the help of the following effects. 

i) Blocks effects (bj) = 1/C (Tbj - C - T vb |) (j=1 to b) 
b, = % (98 - 37.3 - 60) = Vi (0.7) = 0.17 

b 2 = Vi (80 - 37.3 - 44) = 'A (-1.3) = -0.32 
b 3 = % (75 - 37.3 - 37) = 'A (0.7) = 0.17 

b 

Counter check : ^ bj = 0 

Ignoring rounding errors. In this case (Ibj = 0.17 - 0.32 + 0.17 = 0) 

I) 

ii) Mean effect (m) = 1/e (GT - (b-1) C - «> hj) where, n, 

I 

is the number of V occurring in j block. 

In the present case 

m = 1/12 [(253 - (3-1) 37.3 - (3 x 0.17 + 2 x (-0.32) + 3 x 0.17)] 

= 1/12(253-74.6-0.38] 

= 1/12x178.02 
= 14.8 

If nj is the same in all blocks, then 
m = 1/e (GT - (b - 1) C) 

iii) Check effects (Cj) = C s - m (i = 1 to C) 

Ct = 8.0 - 14.8 = -6.8; C 2 = 4.7 - 14.8 = -10.1 
C 3 = 9.0-14.8 = -5.8; C 4 = 15.7 - 14.8 = 0.9 

iv) Adjusted mean of test variety (Vi) and genotypic effects (Vi”) 
are now computed as under Table 8. 
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Table 8 : Adjusted means and genotypic effects. 


Test 

variety 

Unadjusted 
mean (Vi) 

Adjusted mean 

(Vi' = Vi-bi) 

Genotypic effect 

(Vi" = Vi'-m) 

Vi 

6 

6-0.17 = 5.83 

5.83-14.8 = -8.97 

v 2 

40 

40 + 0.32 = 40.32 

40.32-14.8 = 25.52 

v 3 

4 

4 + 0.32 = 4.32 

4.32-14.8 = -10.48 

V 4 

24 

24 - 0.32 = 23.83 

23.83-14.8 = 9.03 

V 5 

11 

11 -0.17 = 10.83 

10.83-14.8 = -3.97 

V 6 

18 

18-0.17= 17.83 

17.83-14.8 = 3.03 

V 7 

8 

8-0.17 = 7.83 

7.83-14.8 = -6.97 

V 8 

30 

30-0.17 = 29.83 

29.83-14.8 = 15.03 


bj is block effects of block where Vi falls in 

ANOVA is now carried out using the data in tables 8 and 


previous table developed different value as under: 

GCF (General correction factor) GT 2 /N = (253) 2 /20 = 3200.45 

bSS=^Tb i 2 /(c+n j )-GCF = ^l + ^i + ^- - 3200.45 

= 1372 + 1066.66 + 803.57 - 3200.45 
= 3242.23 - 3200.45 = 41.78 
bMS =bSS/(b-1) = 41.78/2 = 20.89 

TSS = ZZ +C ? + Z Vi_GCF 

i=1 j=1 i 

— x 2 ! + x 2 2 + —i- X 12 + v 8 2 + — +Vg 2 — GCF 
= 10 2 +8 2 + 6 2 +—+ 14 2 + 18 2 + 30 2 + 4 2 +—24 2 +18 2 - 3200.45 
= 4897.00 - 3200.45 = 1696.55 
cCF = (check correction factor ) = Tc 2 /bc = 112 2 /12 = 1045.3 

CSS = X T ci /b - cCF = _- 4 . 2 +14 2 +27 2 +47 2 _ 1 ^ 

i ^ 

= 1236.6-1045.3 = 191.3 
cMS = cSS / (c-1) = 191.3/3 = 63.7 


eSS = (m x GT) + 


( b 


X bj ' b > 


Tt 


V. i 


( V 1 l u 

V" 


V 1 


£Tci Ci + v " - £ T »i / ( c+n i) 


V 1 


V 1 


= (14.8 x 253) + (0.17 x 98) + (-0.32 x 80) + (0.17 x 75) + 

[24 x (-6.81)] + [14 x (-10.1)] + (27 x 5.8) + (4.7 x 9) + 

[6 x (-8.97)] + (40 x 25.52) + [4 x (-10.48)] + [24 x 9.03)] + 
[11 x (-3.97)] + (18 x 3.03) + [8 x (-6.97)] + [30 x 15.03)] 


( 


98' 


80' 


+ 


75 2 


A 


4 + 3 4 + 2 4 + 3 


= 1634.8 


v • ■ - ■ ’ - / 

eMS = eSS/(e-1) = 1634.1 /II =148.6 

vCF (variety correction factor) = Tv 2 /V = 141 2 /8 = 2485.1 


v 

vSS = V Vi vCF = 6 2 + 40 2 + .... + 8 2 + 30 2 -; 


2485.1 


= 3637.0-3485.1 =1151.9 

VMS = VSS / (v-1) = 1151.9/7 = 164.5 

cvSS = eSS - cSS - vSS = 1634.8 - 191.3 - 1151.9 = 291.6 

ESS = TSS - eSS - bSS = 1696.55 - 1634.80 - 41.78 = 19.97 

EMS = ESS / (c-1) (b-1) = 19.97/6 = 3.32 

Presentation of results: The ANOVA table is presented as 
under: 


Table 9 : ANOVA 


Source of variation d.f. _SS MSS 

Blocks (b) 


Entries (e) 

Checks (c) 

Varieties (v) 

Checks Vs. varieties 
(cvs.v) 

Error _ 

Total 


2 41.78 

11 1634.80 

3 191.30 

7 1151.90 

1 291.60 


20.89* 6.29 

148.60** 44.75 

63.70** 19.18 

164.50** 49.54 

291.60** 87.88 

3.32 


■ t 


-p<0.05, p<0.01 respectively 


1696.55 
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Mean performance is same as in Table 5 

Inferences: Same as in augmented design I, except that here 
variance due to block effect is significant (P<0.05). Now, following 
four standard errors are helpful in drawing conclusions: 


SEdi (between any two check means) = 


'2EMS 


12x3.32 

3 


= V2.213 =1.48 

SEd 2 (between any two means of test varieties) in the same 
block = V2EMS = 72x3.32 = 2.57 

SEd 3 (between any two test varieties not in the same block) = 

i 2 K i+ ^W 2xa32x K) 

= 76.64x1.25 = 7^3 = 2.88 
SEd 4 (between mean of a check and a test variety) 

= /EMsfl + - + - + — ) = 73.32x1.66 =2.34 
\ { b c be J 

CD is than calculated using these SEd x t at 5% or 1% levels (of 
d.f. = 6) as the case may be : 

Table 10: ANOVA of augmented design II for large collections 

(N = 1300, b=10, C=10, V=1200, e=1210) 


Source of variation d.f. MSS 


Block (b) 

Entries (e) 
Checks (c) 

Test varieties (v) 

c Vs. v 

Error 


b-1 =9 
e - 1 = 1209 
c-1 =9 
v-1 =1199 


(c -1) (b -1) = 81 


(N-1) = 1299 
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Extension of augmented design II for large collections 

Field designs layout and ANOVA etc. all will follow the same 
pattern as elaborated above. For a testing of large collections (e.g. 
1200 genetic stocks + 10 checks replicated 10 times) the following 
ANOVA is exemplified (Table 10). 

General consideration: Some general points in connection with 
greater utility of augmented design II are: 

1. The number of test genotypes may be kept the same in each 
block. This will greatly facilitate statistical analysis. 

2. A block may continue in next (contiguous) strip also. However, as 
far as possible, plots in each block may be kept homogeneous 
for soil factors. Homogeneity of soil in a block can thus be 
maintained by increasing the number of blocks (adequate 
stratification of experimental area) and by reducing the plot size. 

3. Augmented design II serves as the best design for initial 
screening of large collection generally unmanageable by any 
other design. The choicest genotypes are reevaluated in RBD for 
intensive investigation. 

4. While determining genetic divergence among extensively large 
number of genotypes through multivariate analysis, this design 
can help both evolutionary geneticists and plant breeders alike. 

Thus, augmented designs are unique in that they are employed 
in situation where no other design can logically be adopted. 

Augmented BIB Design 

Consider any BIB design with parameters v, b, r, k and X and 

augment every block by adding s-th (s=1, 2.,t) supplementary 

treatment to p s plots and call the resultant design as D*. It will have 
(v+t) treatments and every block will have k* = k + ^p s = k + k 0 

s 

plot. 

Example: The layout plan of a BIB augmented design is given 
along with the yield per plot. Analyse data. 



Yield in kg/plot 


(3) 25.3 

(6) 19.9 

(9) 29.0 

(11)24.6 

(14) 205 

13) 23.0 

(4) 19.8 

(8) 33.3 

(12) 22.7 

(14) 18.6 

|^|^i 

RBEEl 





i mrm 

nuESSii 

IBRSI 


_ 1 - - - 

(7) 23.4 

(8) 30.5 

(9) 30.8 

(10)32.4 

(14) 22.0 

* -L --- 

(4) 30.6 

(5) 32.4 

(6) 27.2 

(10) 32.8 

(14) 19.3 

(1)34.7 

(5)31.1 

(9) 25.7 

(12) 30.5 

(14) 24.1 

_ 1 — L - 

(3) 34.4 

(5) 32.4 

(7) 33.3 

(13) 36.9 

(14) 17.5 

(1)38.2 

(2) 32.9 

(3) 37.3 

(10) 31.3 

(14) 21.2 

(2) 28.7 

(4) 30.7 

(9) 26.9 

(13) 35.3 

(14) 23.5 

(1) 36.6 

(4) 31.1 

(7) 31.1 

(11) 28.4 

(14)21.3 

(1)31.8 

(6) 33.7 

(8) 27.8 

(13) 41.1 

(14) 20.7 

(2) 30.3 

(6) 31.5 

(7) 39.3 

(12) 26.7 

(14) 19.8 


The figures in bracket indicate treatment numbers. 


Calculations 

i) Here v* = v + t = 14, b = 13, r = 4 for basic treatments and 13 
for supplementary treatment and k* = k + 1 = 5. 

ii) Computation of Qj’s 


Table 


Treatment 

number 

Treatment 
total (TJ 

Block 
numbers 
in which i- 
th 

treatment 

occurs 

Sum of all 
the blocks 
containing 
i-th 

treatment 

(SBj) 

0) 

o^Ti-^tyk* 

(') 

1 

141.3 

7, 9, 11, 12 

610.5 

19.20 

2 

119.2 

4, 9, 10, 13 

579.6 

3.28 

3 

120.2 

1,2, 8,9 

552.1 

9.58 

4 

112.2 

2, 6, 10, 11 

553.2 

1.56 

5 

122.9 

4, 6, 7, 8 

568.9 

9.12 

6 

112.3 

1,6, 12, 13 

564.3 

-0.56 

7 

127.1 


589.6 

9.18 

8 

127.2 

BHdKM 

537.6 

19.68 

9 

112.4 

1,5, 7, 10 

549.5 

2.50 
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Treatment 

number 

T reatment 
total (Tj) 

Block 
numbers 
in which i- 
th 

treatment 

occurs 

Sum of all 
the blocks 
containing 
i-th 

treatment 

(IBj) 

0) 

O i =T,-^B j /k* 

(0 

10 

112.7 

3, 5, 6, 9 

556.8 

1.34 

11 

89.7 

1,3, 4, 11 

508.2 

-11.94 

12 

111.6 

2, 3, 7, 13 

525.2 

6.48 

13 

139.9 

kllWH 

569.2 

26.06 

14 

267.8 

1 to 13 

1816.3 

-95.46 


^ Of = 190045 , 


r-A 3 
rk * " 20 


Treatment S.S. (adjusted) 


= 1 Tof- 

r(1-M*) i ( 1 -p*)rv “ bp s 


1229.00 , 


Here Q. = 1°. 

i=1 

Total S.S. = 2475.28 

Block S.S. (unadjusted) = 571.43 

Q, s denote the adjusted total of s-th (s=1, 2.t) supplementary 

treatment. 


iii) Analysis of variance 

ANOVA Table 


Source 

d.f. 

S.S. 

M.S. 

F 

Blocks 

(unadjusted) 

12 

571.43 

47.62 


Treatments 

(adjusted) 

13 


94.54 


Error 

39 



BWHI 

_Total 

64 


-- 

-- J 


* indicates significant at 1 per cent level 


























It can be inferred that the treatment effects significantly 
iv) Variances of elementary contrasts 
Variance of elementary contrasts are 


V 




2s; 


r(l - ju*) 
treatments of basic set 


= 10.18 kg 2 when i and j are 


1 


r(1-M*) 


i .i 


U* 't 

1 ~— + , 
r(v) J bp, j 


2 

= 6.36 kg for comparison between basic and supplementary 

treatments. 


□ □ □ 
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Block Designs 



Block designs are widely used in many fields of research. Their 
most common type is the randomized block design. It has many 
advantages over other designs, but it is not applicable in all 
circumstances. If the number of treatments is too large to preserve 
homogeneous conditions within complete block or if the size of block 
is determined by the nature of experimental material or technique, we 
have to resort to incomplete block design where every treatment is 
not included in a given block. 


Balanced Incomplete Block Designs (BIBD) 

The most important class of incomplete block designs is the 
balanced incomplete block (BIB) design introduced by Yates (1936). 
A BIB design is an arrangement of v treatments into b blocks each of 
size k(<v) treatments, satisfying the followings conditions. 

(i) Every treatment occurs at most once in each block. 

(ii) Every treatment occurs in exactly r blocks. 

(iii) Every pair of treatments occurs together in exactly X blocks. 

These designs have the characteristic that all the contrasts are 
confounded with the blocks to the same extent and every elementary 
contrast is estimated with same variance. 
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Partially Balanced Incomplete Block Design (PBIBD) 

The restriction that each pair of treatments should occur exactly 
X times in the design is demanding and unfortunately, the BIB 
designs do not exist for all meaningful parametric combinations. 
Moreover, these designs require a large number of replicates for 
given v and k. This led to the investigation of other types of 
incomplete block designs in which the restriction of every contras! 
being confounded with blocks to same extent is relaxed. The design 
where some of the treatment contrasts are more confounded than 
others are partially balanced incomplete block (PBIB) designs 
introduced by Bose and Nair (1939). For PBIB designs with m- 
associate classes, Var (a r aj) may take m different values depending 
upon whether i th and j th treatments are first, second or m th associates. 
Bose and Shimamoto (1952) introduced the concept of partially 
balanced association scheme and the definition of PBIB designs was 
rephrased in this context. The definition of m class-association 
scheme and the PBIB design is given below: 

Given v treatments, 1, 2, ., v, a relation satisfying the 

following conditions is said to be an association scheme with m 
classes: 

(i) Any two treatments are either 1 st , 2 nd ,., or m-th associates, 

the relation of association being symmetrical; that is, if the 
treatment 0 is i-th associate of <(), then <|) is also i-th associate of 0. 

(ii) Each treatment 0 has nj i-th associates, the number nj being 
independent of the particular treatment 0 chosen. 

(iii) If any two treatments 0 and are i-th associates, then the 
number of treatments that are j-th associates of 0, and k-th 
associates of <j> is p' jk and is independent of the pair of i-th 
associate 0 and <\>. 

The numbers v, n ( (i=1, 2. m) and (i, j, k=1, 2.m) 

are called the parameters of association scheme. 

Given an m-class association scheme, we have a PBIB design 
with m classes, based on the scheme; if it is possible to arrange the v 
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treatments into b blocks such that. 

(i) Each block contains k distinct treatments. 

(ii) Each treatment occurs in r blocks. 

(iii) Any two treatments which are i-th associates occur together in A, 

blocks, (i=1, 2.m), where A, is constant independent of the 

pair of the i-th associates chosen. 

The integers v, b, r, k, \ (i= 1, 2, .. m) are called the 

parameters of the PBIB design. 

From practical view point, the two associates class PBIB designs 
are important and an exhaustive list of these design was prepared by 
Bose, Clatworthy and Shrikhande (1954). The list was subsequently 
revised by Clatworthy (1956, 1973). Bose and Shimamoto (1952) 
have classified the two associates class PBIB designs in to five 
following categories depending upon the association scheme on 
which they are based. 

i) Group divisible (GD) 

ii) Simple (SL) 

iii) Triangular (T 2 ) 

iv) Latin square type (Lj) 

(v) Cyclic (C) 

Partially Efficiency Balanced (PEB) Design 

In BIB and PBIB designs, the restriction of equal block size and 
equal number of replications for each treatment may be a serious 
practical obstacle in many possible experimental circumstances in 
biological research. With biological material there are natural units 
mat can be used as block and these blocks contain varying number 
of units which are not under the control of experimenter. This may be 
clear from the following example. 

(ii In a nutritional experiment with pigs, the number of young ones in 
the available liters may be unequal, being in some cases equal to 
the number of treatments but in others smaller or larger. 
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(ii) A psychologist have to use families as blocks, but, the numbers 
of family members may very from family to family. 

(iii) The insecticidal sprays, each of two concentrations, are to be 
compared on different batches of insects. The conditions allow 
only 4 batches to be sprayed on one day, and the experimenter 
wants to have 2 replications for each of the 6 treatments and as 
many as 5 for the control. 

(iv) Many new varieties or strains of a crop together with one or two 
standard varieties are to be compared in one field experiment. 
The number of plots for the new varieties is usually severely 
limited by available amount of seed, but no such limitation is 
there for standard varieties. One could replicate standard 
varieties more than newly evolved varieties. 

A further inconvenience in the use of BIB and PBIBD is that 
these are not available for every, meaningful parametric 
combinations. For example, no two associate PBIB design is 
available in 2 or 3 replications for 18 varieties. Moreover, the designs 
may not be available for the number of treatments in which the 
experimenter is actually interested or the available designs may use 
a large number of replication which the experimenter may not be able 
to afford. 

A partial answer to this problem is provided by the work of Puri 
and Nigam (1977). They introduced Partially Efficiency Balanced 
Designs which are available in unequal number of replications and 
block sizes. 

A block Design D (v, b, r, k) with v treatments arranged in b 
blocks of sizes k 1f k 2 .... k b such that i-th treatment be replicated r { 
times (i=1, 2...v) is said to be a partially efficiency balanced (PEB) 
design with m-efficiency classes if a complete set of (v-1) basic 
contrasts can be partitioned in to m-disjoint classes such that 
efficiency factor associated with every contrast of i-th class is Ej = (1- 
Pi), where pj (i=1, 2...m) is the eigen value of M 0 matrix with 

m 

multiplicity Pi such that £p i= v-1. For PEB design Mo will have 
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spectral decomposition. 

m 

M„ = Lj where Lj are mutually orthogonal idempotent 

i=1 

m 

matrices satisfying ^Lj =| v 

i=1 

The parameters of PEB design with m efficiency classes can be 
written as v, b, r, k, jLLj, p„ Lj, i=1,2,...m. 

Here 

N : (njj), a v x b incidence matrix of treatment in to blocks associated 
with every block design D, n^ being the number of times the i- 
th treatment occurs in the j-th block. 

Let r = [r 1f r 2) .r v ] T is a v x 1 vector of replication with i-th element 

as r-„ the replication of i-th treatment, i=1, 2.v. 

k = [k 1t k 2 ,.k b ] T is a b x 1 vector of block sizes with j-th element 

as kj, size of j-th block, j = 1,2.b 

R : v x v diagonal matrix with diagonal elements as r 

K : b x b diagonal matrix with diagonal elements as k 

R' 1 inverse R 

K* 1 : Inverse of K 

n : Total numbers of observation in the design. 

T: Stands for Transpose sign. 

Ql =B-NK' 1 N t 
Mo = R' 1 N K' 1 N t -1 r T /n 

A particular class of PEB designs where jij takes only two values 
p and 0 with multiplicities p and (v-p-1) is termed as simple PEB 
design. The parameters of simple PEB designs can be written as. 

v, b, r, k, pi, p, L 

If jLLj = jlx for all i, the design is called efficiency balanced (EB) [Puri 
and Nigam (1975) & William (1975)]. 
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Optimality of Designs 

In the early stags of agricultural experimentation, only a few 
experimental designs were at one's disposal. But now-a-days an 
experimenter can use a variety of experimental designs. Hence 
whenever the conditions of an experiment allow the possibility of 
simultaneous existence of a number of designs, the questions of 
selection of an appropriate design, a design which is easy to analyse 
and which has some optimum properties, naturally arises. A 
systematic study of the specification of optimum experimental 
designs was under taken by Kiefer (1958, 1959) where he introduced 
the various optimality criteria (A, D, E, L, M) and discussed 
interrelations amongst these and established the optimality property 
of some well known designs. 

A-optimum Designs 

One of the objectives of designing experiments is to ensure 
enhanced accuracy and reliability of conclusions and inferences. 
Such accuracy depends on variances of estimates of treatment 
contrasts or comparisons, the less such variance the more is the 
accuracy. Designs can be compared by using suitable functions of 
variances of estimates of all normalized independent (orthogonal) 
contrasts of treatment effects. One such function is the mean of 
reciprocals of variances of estimates of each of the normalized 
contrasts of treatments when the contrasts are mutually orthogonal. 

A design where such a mean is maximum among all the competing 
designs is called an A-optimum design. Mathematically mean of the 
reciprocals of the Eigen value of C matrix of a design as a criterion 
forjudging a design as A-optimal. 

D-optimum Designs 

In A-optimum designs discussed above, we have used the mean 
of the reciprocals of the Eigen value of C Matrix of a design as a 
criterion for judging a design. In D-optimum designs, the criterion 
used is the product of the Eigen values and this should be minimum 
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for a D-optimum design over similar competing designs. It is not 
possible to indicate what criterion should be used i.e. whether D- 
optimum design or A-optimum design for discriminating designs. For 
any pronouncement in this regard, the situation and objectives of the 
experiments have to be taken into account. 

E-optimum design 

Again, there is one more of type of optimum design known as E- 
optimum designs. A design is said to be E-optimum if the maximum 
Eigen value of C Matrix is less than the maximum Eigen value of C 
Matrix of any other design. 

This way, there are several other criterion for optimization and 
considerable research activities are available in literature in regard to 
such optimum designs. However, no systematic tabulation of such 
designs exists in literature. For the definition of L and M optimality 
criteria refer to Kiefer (1958). 

There can be another type of optimum designs. The designs 
discussed above are assessed by taking into account, total 
information that a design can provide (emanating from all orthogonal 
contrasts). But there are situations where experimenters are 
interested mainly in certain specific contrasts. If there can be designs 
which allow estimation of such contrasts with full precision such 
designs are also optimum though in a limited but more meaningful 
manner. 

Analysis of Two Associates Partially Balance Incomplete 
Block Design Analysis of some important types of two associate 
PBIB designs are given. The procedure of analysis is very much 
simple and straight forward. 

Group divisible (GD) design 

A PBIB with two associate class is said to be GD it there are 
v=mn treatments and the treatments can be arranged in m groups of 
n treatment each, such that any two treatment of the same group are 
first associate and any two treatments from different groups are 
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second associates. The parameters of GD design are v = mn, b, r, k 
Xi and X 2> = (n-1), n 2 = n (m-1) 

Example: The following gives the plan and yield in kg/plot of 15 
varieties of wheat in yield trial. The figures in bracket indicate the 
variety number. 

Yield in kg/plot 


(15) 2.4 

(9) 2.5 

(1)2.6 

(13) 2.0 

(5) 2.7 

(7) 2.8 

(8) 2.4 

(1)2.7 

(10) 2.6 

(1)2.8 

(14) 2.4 

(2) 2.2 

(15) 3.4 

(11)3.1 

(2) 2.1 

(3) 2.3 

(6) 4.1 

(15) 3.3 

(4) 3.3 

(7) 2.9 

(12) 3.4 

(4) 3.2 

(3) 2.8 

(1)3.0 

(12) 3.2 

(14) 2.5 

(15) 2.4 

(8) 2.6 

(6) 2.3 

(3) 2.3 

(14) 2.4 

(5) 2.7 

(5) 2.8 

(4) 2.8 

(2) 2.6 

(13) 2.5 

(10) 2.5 

(12) 2.7 

(13) 2.8 

(6) 2.6 

(9) 2.6 

(7) 2.6 

(10) 2.3 

(3) 2.4 

(8) 2.7 

(6) 2.7 

(2) 2.5 

(9) 2.6 

(5) 3.0 

(9) 3.6 

(11) 3.2 

(12) 3.2 

(7) 3.0 

(13) 2.8 

(14) 2.4 

(11)2.5 

(10) 2.4 

(4) 2.5 

(8) 3.2 

(11)3.1 


Analyse the data and draw conclusions. 

Calculations 

(i) Here v = 5x3, b= 15, r = k = 4 Xi = 0, X 2 =1 and also m = 5, n = 3, 
r\i = 2, n 2 = 12. 

The key to the design is the following association scheme among 
the treatment number. 

1 6 

Groups 2 7 

3 8 


11 

12 

13 



Treatment 
number (s,j) 

T reatment 
total T sj 

Blocks in 
which (s,j) th 
treatment 

occurs 

S si 

Qsj-Tsj- 

S S j/k 

(i.i) 

11.1 

1,2, 3,6 

42.5 

0.475 

(2,1) 

9.4 

3,4, 9,12 

42.1 

-1.125 

(3,1) 

9.8 

4, 6, 8,11 

42.9 

-0.925 

(4,1) 

11.8 

5, 6, 9, 15 

47.9 

-0.175 

(5,1) 

11.2 

2, 8, 9,13 

44.0 

0.200 

(1,2) 

11.7 

5, 8, 10, 12 

44.4 

0.600 

(2,2) 

11.3 

2, 5, 11, 14 

44.8 

0.100 

(3,2) 

10.9 

2, 7, 12, 15 

43.0 

0.150 

(4,2) 

11.3 

1, 11, 12, 13 

42.9 

0.575 

(5,2) 

9.8 

3, 10, 11, 15 

41.7 

-0.625 

(1,3) 

11.9 

4, 13, 14, 15 

45.8 

0.450 

(2,3) 

12.5 

6,7, 10, 13 

46.7 

0.825 

(3,3) 

10.1 

1, 9, 10, 14 

41.5 

-0.275 

(4,3) 

9.7 

3, 7, 8,14 

41.1 

-0.575 

(5,3) 

11.5 

1,4, 5,7 

44.7 

0.325 


Here T sj denote the total of all the plots receiving (s,j) treatment. 
S sj denote that total value of all those blocks which receive the (s,j) 
treatment. 


We now arrange the treatments group wise and propose the 
following two way Table. 


Grout 


0.475 

-1.125 

-0.925 

-0.175 

0.200 


0.600 

0.100 

0.150 

0.575 

-0.625 


0.450 
0.825 
-0.275 
-0.575 
n 39R 


1.252 I 

-0.200 i 

-1.050 

-0.175 
_n inn 
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II Q 2 S j = 4.5262 IQ 2 s = 3.5087 

S j S 


We compute the following 


Mi = 


r — A 1 _ 1 
k ~ 4 


M 2 = 


rk-vA 2 

rk~ 


16 


Treatment SS (adjusted) = 

f o \ 


1 




IIQ sj -I 

S j s 


91 s 

n 


1 


Crs 


r( 1 -M 2 )s n 


= 1.5640 


Block S.S. unadjusted = 4.9233 Total S.S. = 9.1733 


ANOVA 


Source 

D.F. 

S.S. 

M.S. 

F 

Block (unadjusted) 

14 

4.9233 



T reatment 

14 

1.5640 

0.1117 

1.289 

(adjusted) 





Error 

31 

2.6860 

0.0866 


Total 

59 

9.1733 




From the above table it is inferred that the effects of varieties do 
not differ significantly. Variance of elementary contrasts can be 
worked out as: 


V (ts-tj) = 


2s 2 


= 0.0577 when s and j are first associate 


2s 2 

r(i - Mi) 


fi+ 

l n(1-p 2 ) 


J 


= 0.0539 otherwise 


L 2 PBIB Design 

In L 2 type PBIB designs, there are v 2 treatment arranged in a v x 
v square so that any two treatments are first associate if they occur in 
same row or same column and are second associate otherwise. The 
parameters of L 2 PBIB design are v 2 , b, r, k, A. 1f A 2 . m = 2 (v-1) and n 2 
= (v-1) 2 . 
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Example 


The layout plan of PBIB design 6 blocks of size 3 is given along 
with the yield/plot. Analyse the data and draw conclusions. 


Yield in kg/plot 


(1)31 

(2) 28 

(3) 25 

(4) 61 

(5)61 

(6)54 

(7) 46 

(8) 56 

(9) 59 

(1)48 

(4) 61 

(7) 42 

(2)35 

(5) 61 

(8) 59 

(3) 53 

(6) 62 

(9)46 

Calculation 

Here v 2 = 9, 

b—6, r — 2, k — 3, — 1 f A ,2 = 

o, ^ = 4, n 2 = 4 


The key to the design is the following association scheme among 
the treatment number. 

1 2 3 

4 5 6 

7 8 9 


Computation of Qjj’s 


Treatment 
number (s,j) 

Treatment 
total T S j 

Blocks in 
which (s,j) th 
treatment 

occurs 

S si 

Qsi-T S j- 

Ssj/k 

(i.i) 

79 

1,4 

235 

0.67 

(1,2) 

63 

1,5 

239 

-16.67 

(1.3) 

84 

1,6 

251 

0.33 

(2,1) 

122 

2,4 

327 

13.00 

(2,2) 

122 

2,5 

331 

11.67 

(2,3) 

116 

2,6 

343 

1.67 

(3,1) 

88 

3,4 

312 

-16.00 

(3,2) 

115 

3, 5 

316 

9.67 

(3,3) 

105 

3, 6 

328 

-4.33 




Z4U -- - - 

We now arrange the treatments in v x 
following two-way Table. 

v square and prepare the 

Group 

1 

2 

3 

Qi. 

1 

0.67 

-16.67 

0.33 

-15.67 

2 

13.00 

11.67 

1.67 

26.34 

3 

•16.00 

9.67 

-4.33 

-10.66 

Qi_ 

-2.33 

4.67 

-2.33 



LI Q ij = 954.6823 

i j 


LQ 2 i.= 1052.9801 
i 


LQ 2 .j = 32.6667 

j 

The following can be easily computed 
,, (r-Ai) + (v-1) (A^Ag) 

Mi ■ it 

= (r-2A,+M = o 


2 

2 


Treatment SS (adjusted) 


IQ 2 i.-IQ 2 )^ 


1 


rv (1 — Mi) (,T i ') r (1-Mz) 

ZZQ 2 ii - -ZQ 2 i. - -IQ 2 i 

i j v j v j J 


= 658.2823 


Block S.S. (unadjusted) = 1820.67 


Total S.S. 


= 2716.00 

ANOVA 



Source 

D.F. 

S.S. 

M.S. 

F 

Block (unadjusted) 

5 

1820.67 

364.13 


T reatment 

8 

658.28 

82.29 

1.39 

(adjusted) 

Error 

4 

237.05 

29.26 


Total 

17 

2716.00 
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It is concluded that the treatment effect do not differ significantly. 


^ A 


v(tj ~tj) = - 


rv 


1 + ———if i and j are first 


(1 -Mi) O-M2) 




associate =79.01 


2 _ 

rv 


( 


+ ———\g otherwise = 98.77 


v 


( 1 -Mi) (1-M2) 


J 


Augmented PBIB Designs 

Consider any m-associate PBIB design, D, with parameters v, b, 


r, k and V (i=1» 2, ..., m). Suppose we add t supplementary 

treatments by adding s-th (s=1, 2, ..... t) supplementary treatment to 

f V 

p s plots ^P s =k 0 of every block and call the resultant design as 

VS J 


D*. It is clear the k*, the block size of the new design, will be k+k 0 and 
v*, the number of treatments in the new design, will be v+t. 

Augmented GD Design: 

Consider a GD design with parameters v=mn, b, r, k, ^ and X 2 
and obtain D* by augmenting t new treatments as explained above. 
The number of treatments and the block size of D* will be mn+t and 
k+k 0 respectively. 

Example: Following table gives the plan and yield in kg. per plot 
of 16 varieties of Wheat in a particular yield trial. The figures in 
bracket indicate variety number. 


Yield in kg./plot 


(15) 2.4 

(9) 2.5 

(1) 2.6 

(13) 2.0 

(16) 2.3 

(5) 2.7 

(7) 2.8 

(8) 2.4 

(1) 2.7 

(16) 2.4 

(10) 2.6 

(1)2.8 

(14) 2.4 

(2) 2.2 

(16) 2.7 

(15) 3.4 

(11)3.1 

(2) 2.1 

(3) 2.3 

(16) 2.9 

(6) 4.1 

(15) 3.3 

(4) 3.3 

(7) 2.9 

(16) 3.1 

(12) 3.4 

(4) 3.2 

(3) 2.8 

(1) 3.0 

(16) 2.7 

(12) 3.2 

(14) 2.5 

(15) 2.4 

(8) 2.6 

(16) 2.4 

(6) 2.3 

(3) 2.3 

(14) 2.4 

(5) 2.7 

(16) 2.5 
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(5) 2.8 

(4) 2.8 

(2) 2.6 

(13) 2.5 

(16) 2.1) 

(10) 2.5 

(12) 2.7 

(13)2.8 - 

(6) 2.6 

(16)2.9 

(9) 2.6 

(7) 2.6 

(10)2.3 

(3) 2.4 


(8) 2.7 

(6) 2.7 

(2)2.5 

(9) 2.6 

(16) 2.1 

(5) 2.7 

(9) 3.6 

(11)3.2 

. (12) 3.2 

(16) 2.6 

(7) 3.0 

(13) 2.8 

(14) 2.4 

(11)2.5 

(16) 2.3 

■HEjEEBI 

W 2.5_ 

(8) 3.2 

(11)3.1 

_(M_ 


Analyse th^f data and draw conclusion 


Calculations 

(i) Here v* = v + t = 16, b = 15, r = 4 k* = k+k 0 = 5 

If Qjj are the adjusted totals of (i, j) treatment of a GD association 
schemes, then we have 

(ii) Calculation of Qj/s 


Computation of Q,j 


Treatment 

number 

(i, i) 

Treatment 

tptal 

(Tij) 

Block 

numbers in 
which (i, j) 
treatments 

occur 

Sum 
blocks 
where (i, 
treatments 
occur (Su) 

of 

j) 

Qij = Tjj 
-Sj/k* 

(1.1) 

11.1 

1,2, 3,6 

52.6 


0.58 

(2,1) 

9.4 

3, 4, 9, 12 

52.6 


-1.12 

(3,1) 

9.8 

4, 6, 8, 11 

54.0 


-1.00 

(4,1) 

11.8 

5, 6, 9, 15 

58.8 


0.04 

(5,1) 

11.2 

2, 8, 9, 13 

53.7 


0.46 

(1.2) 

11.7 

5, 8. 10, 12 

55.0 


0.70 

(2, 2) 

11.3 

2, 5, 11, 14 

55.9 


0.12 

(3, 2) 

10.9 

2, 7, 12, 15 

52.2 


0.46 

(4, 2) 

11.3 

1, 11, 12, 13 

52.3 


0.84 

(5, 2) 

9.8. 

3, 10, 11, 15 

526 


-0.72 

(1,3) 

11.9 

4, 13, 14, 15 

55.6 


0.78 

(2, 3) 

12.5 

6, 7, 10, 13 

56.7 


1.16 

(3, 3) 

10.1 

1, 9, 10, 14 

52.1 


-0.32 

(4, 3) 

9 ; 7 

3, 7, 8, 14 

51.3 


-0.56 

(5, 3) 

11.5 

1.4, 5,7 

55.4 


0.42 

16* 

38.7 

1 to 15 

202.7 


-1.84 


* Indicates supplementary treatment 



Block Designs 


We arrange the treatments groupwise and prepare the following 
two-way table: 

Values of Qy 

Group 

1 

2 

3 

Qi. 

1 

0.58 

0.70 

0.78 

2.06 

2 

-1.12 

0.12 

1.16 

0.16 

3 

-1.00 

0.46 

-0.32 

-0.86 

4 

0.04 

0.84 

-0.56 

0.32 

5 

0.46 

-0.72 

0.42 

0.16 

Q.. = 

1.84, 





XlQii = 72904 

i j 


VQp = 5.1368 

i 


The following computations can now easily be performed 


A,=- 


.1 


SI Q t-^Z Q '| =1 - 3945 

• i ' y 


a 2 =- 


2 

rn 


Q 2 ] 


" m 


= 0.6771 


U 


* vr 1 . (r-X,-,) 1 

1 bk * 20 * ' rk 16 

* vr 1 , (rk-vX 2 ) 1 

2 bk * H2 5 H2 rk 4 


Treatment S.S. (adjusted) = 


Q‘ 


rv 




9k 

bps 


= 2.5964 

Total S.S. =10.97, 

Block S.S. (unadjusted) = 4.39 
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iii) Analysis of variance 


ANOVA Table 


Source 

d.f. 

S.S. 

M.S. 

F 

Blocks (unadjusted) 

14 

4.39 

0.3135 


Treatments 

15 

2.60 

0.1733 

2.053** 

(adjusted) 

Error 

Total 

45 

74 

3.98 

10.97 

0.0844 



** indicates significance at 5 per cent level 


iv) Variance of elementary contrasts 

Variance of elementary contrasts are 

V ft -t )=—-—So = 0.0444 kg 2 when i and j are first 
v ' r(1 - p*) e 

associates 


1 


r(1-Mi) 

second associates 


1 n-1 

+ 


1 + 


M2 - Mi 1 


n (i - m 2 )Ij 


s 2 = 0.0472 kg 2 when i and j are 


m-1 


1 


^ I 1 , 2 A AAA^ 


0 


s* =0.0331 kg*. 


vr m(1-|i 1 *) vr( 1 -jH 2 ) b Pj 

Where one is supplementary and the other is basic treatment 


Augmented L 2 PBIB design: 

Consider an L 2 PBIB design with parameters v 2 , b, r, k ^ and 

augment every block by adding s-th ( s=1, 2. t) 

supplementary treatment to (3 S plots, and call the resultant design ad 
D*. It is clear that k*, the block size of the new design, will be k+k 0 
and v*, the number of treatments in the new design, will be v 2 +t. 

Example: The layout of PBIB (augmented) design is given 

alongwith the yield in kg. per plot. The figures in bracket indicate 
treatment numbers. 



arnck Designs 
Yield in k g ./plot 
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(1)31 


(3) 28 

(3) 25 

(10) 24 

(4)61 


(5) 61 

(6) 54 

(10) 30 

(7)46 


(8)56 

(9) 59 

(10)28 

(1)48 


(4) 61 

(7) 42 

(10) 29 

(2) 35 


(5)61 

(8) 59 

(10)32 

(3) 59 


(6)62 

(9)46 

(10)39 

Analyse the data and draw conclusions. 



Calculations: 





i) Here v* 

+ t = 10, 

b = 6, r = 2, k* = 

k + k 0 = 4 


*-1 = 1, 

o 

n 





li) If Qjj is 

the adjusted total of (i, j) treatment of an L 2 

association 

scheme, then we have the following computations: 


Table: Computation of Qjj 




Treatment 

T reatment 

Block 

Sum of 

O 

ii 

_-i 

i 

number 

total 


numbers in 

blocks 

Si/k* 

<u> 

(Ty) 


which (i, j) 

where (i, j) 





treatments 

treatments 





occur 

occur (Sh) 


(1.1) 

79 


1,4 

288 

7.00 

(1.2) 

63 


1,5 

295 

-10.75 

(1.3) 

84 


1,6 

314 

5.50 

(2,1) 

122 


2,4 

386 

25.50 

(2, 2) 

122 


2,5 

396 

23.75 

(2, 3) 

116 


2,6 

412 

13.00 

(3,1) 

88 


3,4 

369 

-4.25 

(3, 2) 

115 


3, 5 

376 

21.00 

(3,3) 

105 


3,6 

395 

6.25 

10’ 

182 

1,2, 3, 4, 5,6 

1076 

-87.00 


Indicates supplementary treatment 
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jjj) Arrange the 9 treatments in the form of an l_ 2 association scheme 
and pr epare the following table: __ 

i j 



1 

2 

3 

Qi 

1 

7.00 

-10.75 

5.50 

1.75 

2 

25.50 

23.75 

13.00 

62.25 

3 

-4.25 

21.00 

6.25 

23.00 

Q.i 

28.25 

34.00 

24.75 

87.00 

M 

M 

o 

*=: ro 

= 2076.25, Yq* = 

4407.12, 


M - 

O _.l 

—. no 

II 

: 2566.62 

i 

, Q..= 87.00 




i 


We then compute the following 

=§ where n, = [(r-X,) + (v-1)(X, -\ 2 )]/rk = ^ 
bk 8 ^ 


p 2 = 


rv 


vk 


p 2 =0 


where p 2 = 


[(r-2^ + X 2 )]/rk = 0 


Ai =- 


2 

rv 


2Q 




2.1 


= 321.29 


'J 


A, =- 


2.1 


i j ' j 


= 296.34 


'j 


Treatment S.S. (adjusted) 

>2 2 


Q‘ 


V Al + V -^i s 

tr i - Mi v b P 


rv 

= 2492.40 


Total S.S. =4514.00 

Block S.S. (unadjusted) = 1656.00 

iv) Analysis of variance 



Block Designs 

ANOVA Table 
Source _d.f. 

Blocks (unadjusted) 5 

T reatments 9 

(adjusted) 

Error 9 

Total_ 23 


S.S. 
1656.00 
1492.40 

365.60 

4514.00 


M.S. 

331.20 
276.93 

40.62 


F_ 

6.82** 


★ ★ 


indicates significance at 1 per cent level 

From the above table it can be inferred that the treatment effects 
differ significantly. 

v) Variance of elementary contrasts 

Variance of elementary contrasts are 


J rv 


+ 


v-1,1 


1-Hl 1-M; 


U 


= 48.74 kg 2 when i and j are first associates 


2 _ 

rv 


+ 


v - 2 


.1 


U 


s 2 = 56.87 kg' 


1 -Mi I-M 2 

when i and j are second associates 


bp. 


1 1 

+-- + 


rv 


(v-1) 2 1 2(v-1) , 1 
1-Hl" 1-t*2 




= 32.50 kg 2 

one is supplementary and the other is basic treatment. 


□ □ □ 




aa 

ANALYSIS OF DIALLEL CROSSES 

(GRIFFING’S APPROACH) 


The name diallel which denotes all possible crosses among a group 
of inbred lines, was introduced by Schmidt (1919). For a set of p inbred 

lines, we can have p * 1 2 3 combinations which can be divided into three groups: 

« 

1. The ‘p’ parental lines themselves. 

2. one set of p(p-1)/2 F^sand 

3. the set of p(p-1 )/2 reciprocal F 1 's 

Griffing (1956) gave four methods depending upon the material involved. 

Method 1 : Includes parents, one set of F 1 and reciprocals 

Method 2 : Includes one set of F^s and parents 

Method 3 : includes one set of F-,’s and reciprocal 

Method 4 : includes one set of F^s only. 

The analysis of diallel crosses is a combining ability analysis which 
enables the experimenter into splitting the cross effect into 'gca' effects, 
'sea' effects and reciprocal effects. 

The general combining ability (gca) is the over all performance of 
lines involved in a set of crosses and any deviation from the average 
performance is termed as specific combining ability (sea). Here using a 
suitable statistical model the component variance due to general and 
specific combining ability are estimated which in turn are translated in to 
genetical component like and o£ under certain assumption. Method 
of anal y s ,s for combining ability considering Eisenhart model 1 (fixed 
errect) and model II (random effect) has also been given by Griffing. 
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In each method two steps are involved in the analysis of data. The 
first step consists of analysis of data for testing the null hypothesis that 
there are no genotypic differences among the F-,’s parents and the 
reciprocals. When only significant differences are there, then only there 
is a need for the combining ability analysis. 

Method I : (F 1 ’s + parents + reciprocals) (fixed effect model) 

Here n parents, (n) (n-1) / 2 F^s and n(n-1)/2 reciprocal F^s are laid 
out usually in randomized block design to find out the difference among 
genotypes. The model for RBD will be 

yijk = p + c ij + b k + eij k ...(1) 

k = 1,2.r 

ij = 1,2.n 

Where n = general mean 

c ij = (i,j) th genotypic mean 
b k is the X th block effect and 

e^ is the random error assumed to be normally and independently 
distributed with mean 0 and constant variance a 2 . 

r 

yy = =(>xj) cross totals over all replications 


y..k - ZYijk - k th block total containing all genotypes 

i,j=1 

y... = X y«k .... 

y,k -grand total, 


Analysis for RBD is done to see if the genotypic differences are found 
to be significant then combining ability analysis is done. The Cj, In model 
(1) is further splitted as under: 

C jj = 9j + 9j + Sy + Ty 

Where g ( is the gca effect of i th line 
gj is the gca effect of j th line 
sij is the sea effect of i th and j th line 
and ry is the reciprocal effect of i th and j th lines. Now the full model 
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becomes; 

yfiK = ^ + 9i + gj + s ij + r ij +b k +e ijk 

i,j = 1,2.n 

k= 1, 2, ...r 

Estimation of combining ability effects and the computation of various 
sum of square are illustrated with the help of an example: 

Example : 

Let us consider that 6 parents were involved in full diallel producing 
36 crosses laid out in RBD using four replications. The data on grain 
yield (gms) / plant is given 

Crosses 

Ri 

r 2 

*3 

*4 

1 x 1 

104.86 

84.32 

76.92 

76.48 

1 x 2 

88.66 

105.04 

80.80 

73.54 

1 x 3 

109.76 

78.22 

74.52 

99.52 

1 x 4 

128.10. 

123.84 

92.56 

115.28 

1x5 

128.36 

119.84 

103.24 

129.72 

1 x 6 

74.40 

70.86 

60.94 

68.00 

2x1 

88.70 

69.10 

76.80 

88.16 

2x2 

88.02 

106.52 

89.52 

108.68 

2x3 

110.16 

116.26 

99.76 

120.12 

2x4 

101.26 

80.22 

82.84 

88.36 

2x5 

91.52 

113.96 

87.26 

106.98 

2x6 

59.06 

65.52 

81.62 

86.76 

3x1 

75.28 

124.74 

94.56 

114.34 

3x2 

112.48 

92.76 

90.62 

122.36 

3x3 

77.94 

71.34 

77.52 

69.48 

3x4 

114.44 

119.96 

84.76 

86.42 

3x5 

96.88 

100.86 

86.88 

92.52 

3x6 

109.86 

98.16 

93.26 

102.26 

4 x 1 

124.26 

132.48 

114.38 

105.34 
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4x2 

92.18 

82.16 

81.66 

101.24 

4x3 

98.08 

90.94 

96.2 

125.48 

4x4 

80.82 

106.54 

83.28 

95.92 

4x5 

86.20 

76.36 

79.06 

99.52 

4x6 

103.14 

109.66 

90.98 

119.4 

5x1 

109.74 

99.56 

110.18 

125.68 

5x2 

109.94 

117.52 

95.56 

88.54 

5x3 

89.56 

94.56 

83.66 

85.28 

5x4 

80.96 

71.98 

91.34 

89.28 

5x5 

59.96 

52.48 

52.98 

50.98 

5x6 

98.46 

73.10 

89.18 

75.86 

6x1 

72.92 

76.28 

61.66 

64.48 

6x2 

58.56 

86.72 

65.26 

74.64 

6 x ? 

104.18 

100 24 

85.12 

108.76 

6x4 

109.44 / 

97.74 

121.1 

106.38 

6x5 

81.58 

95.52 

84.48 

90.28 

6x6 

96.44 

98.82 

99.14 

107.16 


Step I: Do the analysis as in case of RBD and prepare the following 
ANOVA Table. 


Table-1 : ANOVA tabel method-1 


Source of 
variation 

d.f. 

sum of square 

Mean Sqaure 

F calcu¬ 
lated 

Replication 

3 

2029.718 



Treatment 

35 

32,776.443 

936.470 

7.942* 

Error 

105 

12381.699 

117.921 


Total 

143 

47,187.860 




Here the genotypic sum of squares comes out to be significant so 
we proceed further for the combining ability analysis 
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Step 2: Combining ability analysis: 

For combining ability analysis the mean of all the crosses over 
replications are found and are presented in table 2. The total variability in 
the population may therefore, be partitioned into components like variance 
due to general combining ability, specific combining ability reciprocal 
and error. Using replication mean as given in table 2 (one observation per 
genotype the various sum of squares are obtained as follows: 

Table-2 : Mean data of full diallel (Method I) 


Parents 

1 

2 

3 

4 

5 

6 

Total 

Mean 

1 

85.645 

87.010 

90.505 

114.945 

120.290 

68.550 

566.945 

2 

' 80.690 

98.260 

111.575 

88.170 

99.930 

73.240 

551.865 

3 

102.230 

104.555 

74.070 

101.395 

94.285 

110.885 

577.420 

4 

119.115 

89.310 

102.675 

91.640 

85.285 

105.795 

593.820 

5 

111.290 

102.890 

88.265 

83.390 

54.100 

84.150 

824.085 

6 

68.835 

71.295 

99.575 

108.665 

87.965 

100.390 

536.725 

Total 

567.805 

553.320 

566.665 

588.205 

541.855 

533.01 

3350.86 

Total 
<*. + y-j ) 

1134.75 

1105.185 1144.085 1182.025 

1065.940 1069.755 6701.72 


Computation of g.c.a. effects 


9i = ^( yi+y .i)-^ yVi = 1.2 1 .6 

9i = 7^.(566.945 + 567.805 )-~ x 3350.86 
= 94.5625 - 93.0794 = 1.4831 


92-* -0.981 
g 3 = 2.261 
0,= 5.423 
9s =-4.251 


similarly 
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g 6 =-3.935 


As a check E g| = 0 


gca values for the parents are given as the diagonal values in table-3. 

Step 3: Specific combining ability (SCA) Effects 

s u = 2 ( Yi i +y i0 _ ^(yi+y-i+yj+y-j)+-j-y... 

1 . 1 A 

s i2 = 2(yi2 + y2i)-^^(yi.+y.i+y 2 +y.2)+T5-y... 

1 1 1 

= —(87.010 + 80.690) -—(1134.75 +1105.185) + ^(3350.86) 

= 83.850-186.661 + 93.079 = -9.732 

Similarly other sea effect can be calculated and are presented in the 
upper half of the table-3. 

Step 4 : Reciprocal Effects 

The reciprocal effects can be obtained 



When y ( j - y^ there will be no reciprocal effect 


1 1 

r 2i = 2 < Y 2l - y i 2 )= 2 (80.690-87.01.) =-3.160 

All other reciprocal effects have been similarly calculated and are 
presented in the lower half of table-3. 

Table-3 : gca (diagonal) and sea (above diagonal) and 
reciprocal (below diagonal) effects (method I) 



P(1) 

P(2) 

P(3) 

P(4) 

P(5) 

P(6) 


1.483 

-3.160 

5.862 

2.085 

-4.500 

0.142 


-9.732 

-.981 

-3.510 

0.570 

1.480 

-0.973 


-0.456 

13.705 

2.261 

0.640 

-3.010 

-0.655 


17.045 

-8.781 

1.272 

5.423 

-0.947 

1.435 


25.479 

13.562 

0.186 

-9.913 

-4.251 

1.908 


-21.935 

-15.896 

8.824 

12.663 

1.164 

-3.935 
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Step : 5 


S.S. due to GCA = ^(yj.+y.j) 



1 


2 x 6 L 


(566.945 + 567.805) 2 + (551.865 + 553.320) : 


+.+(536.725+ 533.01) 2 


2 

6x6 


(3350.86) 2 


= 855.071 

Step 6: 

1 1 2 1 
SS due to SCA = -ssy^yij + y ji )-—£(yi +y.j) 


= j [85.645 (85.645 + 85.645) 

+ 87.010(87.010 + 80.690) +. 

+ 84.150 (84.150 + 87.965) +. 

+ 100.390(100.390 + 100.390) 

- 1/2x6 [(566.945 + 567.805) 2 +. 

+ (536.725 + 533.01) 2 ] + 1/6x6 (3350.86) 2 
= 7136.6395 
Step 7: 

1 2 

SS due to reciprocals = -££(yy — yji) i * j 


= -[(87.010 - 80.690) 2 + (90.505 -102.230) : 


2 L 


+.+(84.150 - 87.965) 2 


= 202.495 

As a check, treatment SS = r (SSgca + SSsca + SSrec) 
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SS due to error need not to be calculated. The mean sum of square 
due to error as obtained in table-2 may be used for this purpose This 
value however, be divided by the number of replications (r=4) before using 

p h rrnr vari a f0r ^mo 6 nU " hypothesis ' MS(error) / r is in fact the mean 
error variance as MSg and MSs have been calculated from the mean 
data. 

The mean error variance required for the F test: 


MS'(error) = M!i erf0r ) = 117.921 = 29 

4 


n 


29.480 


Table 4. Anova for Combing Ability (Method I) 


Source 

d.f. 

SS 

MS 

F 

gca 

5 

855.07 

171.014(Mg) 

5.801** 

sea 

15 

7136.6395 

475.775(Ms) 

16.139** 

reciprocal 

15 

202.495 

13.499(Mr) 

.4586 

error 

105 


29.48(Me’) 



Variance, standard errors and critical differences 

Var(g,) = ^^ (29 . 48) = 2 , 0 ^ 


(n-1) _ 2 25 


Var(s H ) = = -‘^-(29.48) = 20.472 

36 ’ 


Va r( s jj) = ~{r\ 2 - 2n + 2)cr 2 = -12 + 2) x 29.48 


2n 
= 10.466 


1 2 29.48 


Var(r li ) = -o 2 = 

11 2 2 

Var(g i -g J ) = -o 2 = 


= 14.74 


1 2 29.48 


n 


- = 4.913 


V ar (S|j ~ Sjj) = ——ct 2 = x 29.48 = 39 307 

n 6 


6 
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Var(Sjj -s ik ) = 


n-1 2 

-a 

n 


= -x29.48 = 24.567 
6 


Var(Sjj — s w ) = ———g 2 = ^ x 29.48 = 19.653 
J n 6 

Var(ijj - r w ) = a 2 = 29.48 

These values are used for calculating the critical difference for making 
comparison between different effects. Now critical difference can be 
calculated as: 

* \<x/2' error d.f) 

It we want to compare gca effect of parent 3 and 5 the CD is: 

CD = ^Var(a-gj) x t (osi05 ) = V4.913 x 196 = 2047 x 1.96 = 4.013 

y 

Which can be used to compare the mean difference of gca effect of 
parent 3 and 5. The gca of parent 3 is 2.26 and parent 5 is - 4.251. The 
difference in 6.511 which is greater than C.D. value. Hence two gca effects 
differ significantly from each other. Similarly other effects can be compared. 

Model 2 (Random Effect Model) 

In this case the model will remain the same as in the fixed effect 
model except that all the components will be assumed random except 
p, i e. gj, gj, Sy,-r» will be assumed to be normally distributed with mean 
0 and variance a|, a 2 and cr 2 respectively, e jjk>s are assumed to be normally 
and independently distributed with mean 0 and constant variance g 2 . 
The method of computing the various sum of squares will be the same 
as in the fixed effect model, but the EMS values will be different and 
instead of estimating the effects corresponding variances are estimated. 
Here the hypothesis to be tested will be : 

a g = 0 > g 2 = 0, g 2 = 0 

Genetic component 


CT 


2 

g 



M; + n(n -1) M s 


n 2 - n + 1 



1 

2x6 


171.014 - 


29.48+6x5x475.775 " 
36-6 + 1 


= 12 f 171014 “ 461 ' 37 °] = - 24 - 197 


n 2 ,36 

•4 'i(^W^ (M *- M;) ’23T< 475775 - 29 - 48 ) 

= 259,139 
1 1 

<*r =2( M r _M e) = 2( 13 - 49_29 - 48 ) = _7 " 5 
CT 2 =29.48 

a g^ a s'°f ar >d a 2 stands for estimate 

Here o 2 =-a^ 
and a 2 = <tq 

cr a = 2 x Og = 2 x -24.197 = -48,394 


al =259.139 

Method 2 (Parents and one set of F^s) fixed effect model 

Here the model: 

Vijk = I 1 + 9 j + 9 j + Sy + b k + e jjk k = 1,2.r 

r<j = 1,2, 3.n 

Considering n = 6 there will be 21 crosses and 6 parents. The data 
from example I has been taken for illustration and is given in Table-5. 
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Table 5 : Data an grain yield (gm) plot 


Ri 

r 2 

r 3 

*4 

1 X 1 

104.86 

84.32 

76.92 

76.48 

1 x 2 

88.66 

105.04 

80.80 

73.54 

1 x 3 

109.76 

78.22 

74.52 

99.52 

1x4 

128.10. 

123.84 

92.56 

115.28 

1x5 

128.36 

119.84 

103.24 

129.72 

1 x 6 

74.40 

70.86 

60.94 

68.00 

2x2 

88.02 

106.52 

89.52 

108.68 

2x3 

110.16 

116.26 

99.76 

120.12 

2x4 

101.26 

80.22 

82.84 

88.36 

2x5 

91.52 

113.96 

87.26 

106.98 

2x6 

59.06 

65.52 

81.62 

86.76 

3x3 

77.94 

71.34 

77.52 

69.48 

3x4 

114.44 

119.96 

84.76 

86.42 

3x5 

96.88 

100.86 

86.88 

92.52 

3x6 

109.86 

98.16 

93.26 

102.26 

4x4 

80.32 

106.54 

83.28 

95.92 

4x5 

86.20 

76.36 

79.06 

9S.52 

4x6 

103.14 

109.66 

90.98 

119.40 

5x5 

59.96 

52.48 

52.98 

50.98 

5x6 

98.46 

73.10 

89.18 

75.86 

6x6 

96.44 

98.82 

99.14 

107.16 

Example: 





Step 1 : 

The various sum of squares are calculated as in RBD and 

the ANOVA table will be : 





Table 6 : 

ANOVA for Method 2 


Source 

d.f. sum of square 

Mean Square 

F-Calculated 

Replication 

3 1723.821 



Treatment 

20 20375.372 

1018.769 

9.565* 

Error 

1 

60 6390.778 

106.513 



83 28489.971 _ 

Her ® ‘ he genotypic sum of square comes out to be significant so we 
proceed further for the combining ability analysis 

2. Combining ability analysis 

aenotvnp^r 9 ' 760 ' n ' S rearran 9 e d by taking their mean. Thus each 

9 yp s represented by one observation only as is given in table-7: 


k 



Table 7 : Replication Mean in Method 2 
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Step-21 Estimating of GCA effects 

The general combining ability effects are defined as follows. 



1 


(n + 2)|_ 


(yi-+yi)- 



9i 


jl 

8 


(yl +y«)- 



i 

91 = 8 


652.590 - - x 1930.115 
6 


= +1.152 


Similarly 


1 

92 = 6 


(y 2 .+y22)-gy- 


i 

92 = 6 


656.445--x 1930.115 
6 


= +1.634 


g 3 = 0.427, g 4 = 4.437, g 5 = -6.404, g 6 = -1.246 
gca effects are given in the diagonal of table-8 

Step-3 : Specific combining ability (sea) effects 


S12 = y« - j(yi+yn+y- 2 +y2 2 )+y^-y- 


= 87.010 - -(652590 + 656.445) + — x 1930.115 
8 56 

= 87.010-163.629 + 62.932 = -7.687 

Similarly sea effects for all other crosses are estimated and are given 
in upper half of the table-8. 
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Step 4: Estimation of sum of square 

SS due to gca 




= - (652.590) + (656.445) + (646.785) z + (678.87)' 


(59214) 2 +(633.40) 2 --x(1930.115) 2 

6 


= -^[2487814.399 - 2483562.609] 


4251.7902 


= 531.473 


Table-8 : gca (diagonal) and sea (above diagonal) 
effects for Method-2 



Pi 

1.152 

-7.687 

-2.984 

17.445 

33.631 

-23.266 

P 2 

- 

1.634 

17.604 

-9.812 

12.790 

-19.058 

P 3 

- 

- 

0.427 

4.621 

8.352 

9.795 

P 4 

- 

- 

- 

+ 4.437 

-4.659 

10.694 

P 5 


- 

- 

- 

-6.404 

-0.110 

P 6 

- 

- 

- 

- 

— 

-1.246 


Step 5 : SS due to sea 

= (^2) I(yi+y » )2 V+1Kn + 2) y2 - 

= (85.645) 2 +(87.010) 2 ....+(54.10) 2 +(84.15) 2 +(100.39) : 
-|[(652.590) 2 +.+(633.40) 2 + (1930.115) 2 




= 182491.1032 - —(2487814.399) +118264.886 

8 

= 182491.1032 - 310976.7999 +133047.99 = 4562.2933 

Check : Treat SS = r (SS due to gca + SS due to sea). 

SS due to error is obtained from the table by dividing the error SS by 
4. Thus SS due to error is 1597.6945. 

The analysis of variance for combining ability analysis in Method 2 is 
given in Table 9. 

Table-9 : ANOVAfor combining ability analysis (Method 2) 


Source 

d.f 

SS 

MS 

FFcal 

gca 

5 

531.473 

106.2946 (Mg) 

3.99 

sea 

15 

4562.2933 

304.152 (Ms) 

11.42* 

Error 

60 

1597.6945 

26.6282(M e ') 



Variance, standard errors and critical difference 


Varfa) = (n ^ -a 2 = —— x 26.6282 = 2.774 
w n(n+2) 6x8 

Varfsjj) = (h 2 + n + 2 ) g 2 = ( 36 + 6 + 2 ) x 26 6282 = 20 922 
' (n + 1)(n + 2) 7x8 

Var(gi - gj = ± x 26.6282 = 6.657 


Var(s,j) = 


(n + 2) 8 

n(n -1) 2 6 x 5 


a = 


(n + 1)(n + 2) 7x8 


x 26.6282 = 14.265 


Var( s ij - s jj) = 


2(n-2) 2 2x4 


(n + 2) 


a = 


8 


x 26.6282 = 26.6282 


Var (Sij - s ik ) = - 2(n + ^ = ^7 x 26.6282 = 46.5993 

(n + 2) 8 


2n 


2x6 


Var ( s ij - s ki) = = ++ x 26.6282 = 39.9423 


(n + 2) 


8 
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The various standard errors can be calculated by taking the square 
root of the various variance and critical difference can be found in the 
usual way. 

Method 2 (Random Effect Model) 

Estimation of component variance: 

c\ = ——(M.-M.) 

9 (n + 2)' 9 s ' 


= -1(106.2946 - 304.152) =. 24.732 
8 

a|= M s -M e ' = 304.152 - 26.6282 = 277.5238 

These component can be translated into genetic component using 
the following equation 

c 2 g =^c%, cl=2c 2 g =2x -24.732 =-49.464 

cl =cl cl= 277.5238 

Method 3 (one set of F^s and reciprocals) 

Fixed Effect Model: 

In this method one set of F^s and reciprocal crosses are used. The 
total entries are n (n-1) F^s and n parents. The data given in table-1 after 
ignoring the values of diagonal i.e. the parent value has been used for 
illustrating the method of analysis which is given in Table-10. 

Table-10 : grain yield (gms) / Plant 



R1 

R2 

R3 

R4 

1 x 2 

88.66 

105.04 

80.80 

73.54 

1 x 3 

109.76 

78.22 

74.52 

99.52 

1 x 4 

128.10 

123.84 

92.56 

115.28 

1x5 

128.36 

119.84 

103.24 

129.72 

1 x 6 

74.40 

70.86 

60.94 

68.00 

2 x1 

88.70 

69.10 

76.80 

88.16 
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2x3 

2x4 

2x5 

2 x6 

3x1 

3x2 

3x4 

3x5 

3x6 

4x1 

4x2 

4x3 

4x5 

4x6 

5x1 

5x2 

5x3 

5x4 

5x6 

6x1 

6 x2 

6x3 

6x4 

6x5 


110.16 

101.26 

91.52 

59.06 

75.28 

112.48 

114.44 

96.88 

109.86 

124.26 

92.18 
98.08 
86.20 
103.14 

109.74 
109.94 

89.56 

80.96 
98.46 
72.92 

58.56 

104.18 
1.09.44 
81.58 


116.26 

80.22 

113.96 

65.52 

124.74 

92.76 

119.96 

100.86 

98.16 

132.48 

82.16 
90.94 

76.36 

109.66 

99.56 

117.52 

94.56 

71.98 
73.10 

76.28 
86.72 
120.24 

97.74 

95.52 


99.76 
82.84 

87.26 
81.62 

94.56 
90.62 

84.76 
86.88 

93.26 
114.38 

81.66 
96.20 
79.06 

90.98 

110.18 

95.56 

83.66 

91.34 

89.18 

61.66 

65.26 
85.12 
121.1 

84.48 


120.12 

88.36 

106.98 

86.76 

114.34 

122.36 
86.42 

92.52 

102.26 

105.34 
101.24 

125.48 

99.52 
119.4 
125.68 
88.54 

85.28 

89.28 

75.86 

64.48 
74.64 

108.76 
106.38 

90.28 


1. Testing the significance of genotypic difference : 

The various sum of squares are calculated as in RBD. The analysis 
of variance (Method 3) is given in Table-11. 
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Table-11 : Analysis of Variance (Method 3) 


Source 

d.f. 

SS 

M.S. 

F 

Replication 

3 

1980.881 



Treatment 

29 

24308.736 

838.232 

6.60 

Error 


87 

10950.440 

125.867 


Total 


119 

37,240.056 



2. Combining Ability Analysis : 



As done earlier the combining ability analysis is done over mean 
value ignoring the diagonal values as given in Table-12. 




Table-12 : Mean Data 



P(1) 

P(2) 

P(3) P(4) 

P(5) P(6) 

Total Yi 

P(1) 

- 

87.010 

90.505 114.945 

120.290 68.550 

481.300 

P(2) 

80.690 

- 

111.575 88.170 

99.93 73.240 

453.605 

P(3) 

102.230 

104.555 

101.395 

94.285 100.885 

503.350 

P(4) 

119.115 

89.310 

102.675 - 

85.285 105.795 

502.180 

P(5) 

111.290 

102.890 

88.265 83.390 

84.150 

469.985 

P(6) 

68.835 

71.295 

99.575 108.665 

87.965 - 

436.335 

Total 

yj 

482.16 

455.06 

492.595 496.565 

487.755 432.62 


(yi.+y.j) 963.46 

908.665 

995.945 998.745 

957.74 868.955 

2846.755 


Estimation of gca effects 


9i = 2^2)t n(yi+yi) " 2y -] 


---[6 x 963.46 - 2 x 2846.755] 

2 x 6 x 4 L 


g, = -^-[5780.76 - 5693.51] = 1.817 
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Similarly 

g 2 = -5.032 
g 3 = 5.878 

g 4 = 6.228 
g 5 = 1.103 
g 6 = -9.995 

and are shown in the diagonal values of table-13. 

Estimation of sea effects 



2(n - 2) 


(yi+y-i+yj+y-j) 


+ 


1 


(n-1)(n-2) 


y- 


11 1 

si 2 = j(yi2 + v2i)-^j(yi + y-i + y2 + y-2)+^;y- 


s 12 = -(87.010 + 80.690) - -(963.46 + 908.665) 

2 8 

+ —X 2846.755 
20 

= 83.85-234.0156 + 142.3377 = -7.828 

Similarly other sea effects for the crosses are calculated and are 
given in the upper half of Table-13. 

'u = 2^'i _ ^i') 


r 2i = ^-(80.690 -87.010)= -3.16 

The other reciprocal effects are given in the lower half of the table-13. 

Step 4: S.S. due to gca : 


1 


2(n - 2) 


£(yi+y.i) 2 - 


n(n - 2) 


-r 
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= ^~^[( 963 ' 46 ) 2 + (908.665) 2 +.+(868.955)' 


6x4 


(2846.755)' 


= 676959.2464 - 6575334.5025 = 1624.7739 
Step 5 : S.S. due to sea : 


2 ss (yij + yji) _ 2) s ( y i- +y -i) 


2 1 2 

+-y . 

(n - 1)(n -2y 


(87.010 + 80.696) 2 + (90.505 +102.330) : 


+ . + 


(84.150 + 87.965) 2 - (963.46) 2 + (908.665) 1 


+ . + 


(868.955) : 


+ 


5x4 


(2846.755) : 


1 

: - x 552016.9338 - 676959.2464 + 405200.7015 

276008.4669 - 676959.2464 + 405200.7015 

4249.922 


1 i 

SS due to reciprocal = — £S( yi , - yjj )‘ 


2 L 


(87.010 - 80.690) 2 + (90.505 -102.230) : 


+. 


(84.150 - 87.965) 2 = 202.4881 

Check: 

Treat SS = r (SS gca + SSsca + SS reciprocal) 

QC , , ... ,, S.S due to error (table 11) 

SS due to error (M ’) =---- 


4 
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10950.44 

4 


s 2737.61 


Table 13 : gca (diagonal) sea (above diagonal) and reciprocal 

below diagonal effects 


P(1) 

P(2) 

P(3) 

P(4) 

P(5) 

P(6) 


-pm P(2) P(3) PW P(5) P(6) 

7^8 7s28 -<3220 14.092 17.978 -18.022 

-3.160 -5.032 12.326 -7.349 10.447 -7.597 

5.862 -3.510 5.878 -4.964 -10.598 9.455 

2.085 0.570 0.640 6.228 -17.885 16.105 

-4.500 1.480 -3.010 -0.947 1.103 0.058 

0.142 -0.973 -0.655 1.435 1.908 -9.995 


The analysis of variance is given in table-14. 

Table 14 : ANOVA for combining ability analysis (Method 3) 


Source 


1624.7739 

4249.922 


324.9548 (M g ) 


10.326* 

15.007* 


3 9 4249.922 472.2135 (M s ) 15.00 

Reciprocal 15 202.4881 13.4992 (M r ) 

Error 87 - 31.4667 (M e ') 


Estimation of variance, standard error and the critical differences 

Varfa) = a 2 = ---x 31.4667 = 3.2778 

1 2n(n - 2) 2x6x4 


Var(s„) = 


(n - 3)cr 3 
2(n -1) 2x5 


X 31.4667 = 9.4400 


.. . , cr 31.4667 
Var ( | ij) = y = —j — =157333 


Var(g, -g,) = _^ 


31.4667 


(n-2) 


= 7.8666 
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Var(s B -s ik ) = ^ ^ o 2 =-x31.4667 = 23.6000 
J n-2 4 

Var(s,j-s kl ) = ^ 1 ^CT 2 =|x 31.4667 = 15.7333 

Now various standard errors and C.D. values can be calculated. 
Method 3 : Random effect Model: 

Estimation of component of variance and their genetic interpretations 

-M s )~(324.9548 - 472.2135) 

= -18.4073 

a 2 = 1(m s -M e ) = 1(472.2135 - 31.4667) = 220.3734 

a? = ±(M S - M e ) = 1(13.4992 - 31.4667) = -8.9837 
a 2 = M e = 31.4667 

where 

a g = ^°A and <r 2 =a£ 

<yj =2x-18.4073 = -36.8146 

ctq = 220.3734 = 220.3734 

Method 4 (One set of F^s only) 

Fixed effect Model 

The experimental material consists of F.,s only, the total entry being 
(n(n-1)/ 2 ( i.e . 15 in case the number of parents is 6. The data given in 
table 5 (Method 2) has been used for the analysis after ignoring the 
parents 

The Analysis of variance is given in table-15. 
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Table 15 : Analysis of Variance (Method 4) 

Source of v ariation d.f. Sumofsqaure Mean Square FeaP 

Replication 3 1785.694 

Treatment 14 12179.294 869.950 7.535* 

Error 42 4848.807 115.448 

T otal 59 18813.796 ~ 

Combining Ability Analysis 

Means of data over replications are calculated as in the previous 
methods for preparing the diallel table required for combining ability 
analysis : 

Table 16 : Mean data for Full diallel (Method 4) 


P(1) P(2) P(3) P(4) P(5) P(6) Total 

P(1) - 87.010 90.505 114.945 120.290 68.550 481.300 

P(2) 87.010 - 111.575 88.170 99.930 73.240 459.925 

P(3) 90.505 111.575 - 101.395 94.285 100.885 498.645 

P(4) 114.945 88.170 101.395 - 85.285 105.795 495.59 

P(5) 120.290 99.930 94.285 85.285 - 84.150 483.94 

P(6) 68.550 73.240 100.885 105.795 84.150 - 1426.01 

Estimation of the gca effects 


9i = 


n( n - 2) 


[n Yi-—2y..] 


g, = ———[6 x 481.300 - 2 x 1426.01] 

g, = -^-[2887.8 - 2852.02] = 1.491 

Similarly 

g 2 = -3 853 
g 3 = 5.827 
g 4 = 5.062 
g 5 = 2.151 
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g 6 = -10.679 

These values are given as diagonal elements in Table 17. 

Estimation of sea effects 


Sii=yii_ (^2)( yi+y i) + (n-lXn-2) y - 

si 2 = y 12 - ^(yi+y- 2 )+t^7( 142601 ) 

= 87.010 - -1(481.30 + 459.925) + — x 1426.01 
4 ' 20 

= 87.010-235.3062 + 142.601 =-5.695 

Similarly the other sea effects can be calculated for the remaining 
error and are presented in Table-17. 

Combining Ability Analysis 

Estimation of sum of squares: 


SS due to gca 



(481.30) 2 +(459.925) 2 .+(432.62) : 


= ^-[1358794.968] - -j-(1426.01) 2 
= 339698.7419 - 338917.42 = 781.3218 


6x4 


(1426.01)' 


Table 17 : ges (diagonal), sea (above diagonal) effects 


P(1) 

P(2) 

P(3) 

P(4) 

P<5) 

P<6) 

Total 

P(1) 

1.491 

-5.965 

-11.880 

13.324 

21.581 

-17.329 

P(2) 


-3.857 

14.533 

-8.108 

6.565 

-7.295 

P(3) 



5.827 

-4.563 

-8.700 

10.670 

P(4) 




5.062 

-16.997 

16.343 

P(5) 





2.151 

-2.389 

P(6) 






-10.679 



SS due to sea 


^f-^2 Sy ' ? + (n-1)(nl2) 


y 2 - 


= (87.01) 2 + (90.505) 2 .+(84.105) 2 - j (481.30) 


+(459.925) 2 + 


+(432.62) 


2 

5x4 


(1426.01) 2 


= 138611.793 - 339698.7419 + 203350.452 

= 2263.5031 


Check: 

Treat SS = r (SS due to gca + SS due sea) 

Table 18 : ANOVA for combining ability Analysis (Method 4) 


Source 

d.f. 

Sum of sqaure 

Mean Square 

Feal 

ges 

5 

781.3218 

156.2644 (Mg) 

5.414 

sea 

9 

2263.5031 

251.500 (Ms) 

8.714 

error 

42 


M’e = 28.862 



_ _ . . „. 1V Error mean square 

SS due to error (M e ') =--—--(Table 15) 


115.448 

4 


28.862 


Estimation of standard Error 


SE( 9i ) 


n-1 
n(n - 2) 



——x 28.862 = 6.0129 
6x4 


SE ( s y) = ~7J a2 = | x 28.862 = 17.3172 
SE (9i ~ 9j) = a2 = “ x 28.862 = 14.431 
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SE(Sjj - s ik ) = 2 ^_ ^ g 2 = x 28.862 = 43.293 

CQ , 2(n-4)a 2 2x2 

SE ( s ij - s w ) = -7 - 7 T— = —— X 28.862 = 28.862 

(n - 2 ) 4 

The various standard error and CD values can be calculated in the 
usual way. 

Method 4 (Random effect Model) 

Estimation of Component of Variance and their genetic interpretation 

1 1 

CT g = (ir^)( Mg " M s)= 4 (156.2644 -251.500) 

= -23.809 

a| = M s = Me = 251.500 - 28.862 = 222.638 
Further 

t 

2 1 2 

CT g = 2 ct a vl = 2 X -23.809 = -47.618 
=<*d <7q = 222.638 


□□□ 



3D 

DESIGNS AND ANALYSIS 
FOR BIOLOGICAL ASSAYS 


A biological assay is an experiment conducted on the living organism 
with a view to compare the relative efficiency of two (or more) materials 
like hormones, drugs, insecticides, etc. on the basis of the effects that 
they produce on the living organisms. Here the comparison is made on 
the basis of two sets of doses, one from the standard preparations and 
the other from test preparations. The standard preparation is likely to be 
sample of either the international standard or a more readily available 
working standard whose potency relative to the international standard 
has previously been worked out carefully and the test preparation is 
unknown potency whose relative potency is to be evaluated. If Zs and Zt 
denote the doses of the standard and test preparations respectively, 
which produce the same response, then comparison of two test 
preparations is obtained by calculating the coefficient called the relative 
potency, which is given by the ratio p = Zs / Zt. Thus the central purpose 
of a bio essay is to estimate the strength of a test preparation in relation 
to the standard preparation in terms of the relative potency alongwith a 
measure of its precision by conducting an appropriate experiment for 
estimating the doses of S and T. 

Every bio assay has three components namely : 

i) Stimuli (materials) which are to the compared 

ii) Living organisms (subjects) to which different stimuli are given. 

iii) A measure of characteristic response produced by the stimuli 
on the organism. 

For practical examples refer to Finney (1978, chapter 4 and 5) 
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A bio assay can be either direct or indirect. For direct assays the 
doses of standard and test preparations are just sufficient to produce the 
desired effect, for example death of an animal. Direct assays are used 
only when both the preparations can be administered in such a way that 
the minimal amounts needed to produce the specified response can be 
measured exactly. In indirect assays predetermined doses are 
administered to the subjects and their responses are recorded. If the two 
preparation involved in an assay contain the same effective ingrediant 
and all the other substance, are inert, then the assay is called an 
analytical dilution assay. Here we will discuss only indirect diluting 
biological assays. 

Metametric Transformation 

The available methods for estimation of relative potency are based on 
the assumption of linear relationship between doses and the response. If 
the relationship is not linear, either dose or response or both are transformed 
to make the relationship linear. These transformations are commonly 
known as linearising or metametric transformation. The transformed dose 
and response are called dose metameter and response metameter. The 
usual transformation are of the type (i) log Z = X, y and (ii) z x = x,y where 
x is the dose metameter, y is the response, Z is the dose in the original 
scale and is a constrant. As a consequence of these transformations, 
the indirect bio assays have been divided in to categories (a) parallel line 
assays (b) slope ratio assays. 

Case 1: For x = log (dose) as linerising transformation, let Y = a s + 
bx s denote the relation between the response, y and x, where x s = log 
z s and z s denote dose of standard preparation. If z t denote dose 
equipotent of the test preparation then p = z s / z t i.e. log p = log z s - log 
z t = x s - x t as x s = log p + x t . We have 

Y = a s + b (log p + x t ) that is 

Y = a t + b^ where a t = a s + b log p 

The relationship of test preparation is also linear like that of a standard 
preparation. These lines for standard and test preparation have the same 
slope and hence are parallel. Here log p = (a t - a s ) / b with a t and a s the 
intercepts and b is the common slope. 

If there are K doses of each of two preprataions and x s and x t denote 
the averages of the dose metameter and y s and y t are the average 
responses for the preparation then it is known that: - 
a s = y s - bx s and a t = y t - bx t 
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on substituiting these values in log p = (a t - a s ) / b we get an estimate 
R of pas log R-x s -x t -(y s -y t )/b 

Two lines are parallel when the dose metameter is log (dose). The 
assays correspor^ng to this transformation are called parallel line assays. 


Here (y s - y t ) is a contrast and it is called the preparation contrast. 

A paralel line assay is said to be symmetrical (SPL) if the number of 
doses in each of the preparation is same otherwise it is called 
asymmetrical (APL). 


Case II : When the linearising transformation is x = z x the two 
equations are y = a + b s x ? and y = a + b t + x t for standard and test 
preparations where b t - b s p* and therefore, p x = b^bg. The corresponding 
assays are called slope ratio assays. These lines interact at the response 
axis. This gives rise to another validity test in addition to the validity test 
for linear relationship. 


Parallel line assays: 


Suppose there are K doses for each preparation, and n subjects be 
allotted to each of these preparations. Let the doses of standard and 

test preparations be respectively denoted by s 1( s 2 .s k and t,, t 2 

.t k . The response of rth subject allotted to p" th (g-™) dose of 

standard (test) preparation be denotely y s p r (y t p r ). The response data 
can be arranged in the form. 


Response data from 2k point Assay 


Response Standard preparation doses Test preparation doses 


S 1* S 2 


. 

t 1- *2- 


. *k 

Vsll 

Vs21 

y sk i 

y«i 

y t2 i 

y t ki 

y s i 2 

y S 22 

ysk2 

y«2 

y t22 

ytk2 

^sln 

y S 2n 

yskn 

ytln 

y t 2n ••• 

y t kn 

Total S 

s 2 ... 

.. s k 

Ti 

t 2 ... 

.. T k 


The analysis and the estimation of the relative potency becomes very 
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much simplified when the doses of each of the preparations are taken in 
geometric progression as given below. 

S, cs, c 2 s.c k_1 s and t, ct, c 2 t.c k_1 t where c is 

a constant while s and t are the initial doses of standard and test 
preparation. 

The doses should be evenly distributed in the range of response in 
which the dose response relationship was investigated for obtaining the 


linearing transformations. 

x si = log c's = log s + i log c (i = 0, 1, 2.k-1) 

x ti = log c't = log t + i log c (i = 0, 1, 2.k-1) 

Here 


— k-1 

x s = log s + -—log c 


_ _ 'J 

x t =log t + —— logc 


and 


- (. k-1^\ 

x S i-x s = i—— 

V 2 J 


log c and 


( 


x ti - x* = 


i- 


k-lA 


logc 


Here Xg and x t denotes the averages of two preparations. Now two 
cases arise when k is odd and when k is even. 

Case 1: When k is odd we choose the base of the logarithm as c so 

that log c is 1. The log dose as deviates from their mean can now be 
written as 


Standard preparations : 

-k-1 -k-3 

2 ’ 2 ’. 

Test preparation : 

-k-1 -k-3 


•- 1 , 0 , 1 , 


k-1 

2 


k-1 


2 


- 1 , 0 , 1 , 


2 











Designs and Analysis for Biological Assays 


278 


Case II: When k is even, the base of log is taken as Vc so that log 
c becomes 2 and hence all the dose deviate [(i-k-1 )/2] log c becomes 
odd integers as shown below : 

Standard preparation : 

-(k-1), -(k-3).-1, 1, 3,.,(k-1) 

Test preparation : 

-(k-1)-(k-3),.......-1, 1,3,.(k-1) 

The regression contrast in each preparation can be obtained by 
multiplying these deviates by the corresponding dose totals and adding 
them. 

Analysis: 

Here first through Analysis of variance technique it is tested (1) the 
response and dose metameter relationship is linear (2) two lines are 
perpendicular to each other. If both these validity tests are found to be 
correct then the relative potency is estimated from : 

log R = x s - x t - ^ s - — and we have 

b 

Zs.-ETi 

y s -y t = 1 . 1 — 

kn 

For the first part of the analysis, preparation contrast among the dose 
totals is obtained as: 

Preparation contrast (Lp) = -£Sj +]TTj 

i i 

when K is odd, the combined regression contrast is given as: 

(Li) = ^(S, + T,) - *|1(:S 2 + T 2 ). + ^(S k + T k ) 

and when K is even 

(Li) = -(k - IKS, + T,) - (k - 3)(S 2 + T 2 ).+(k - 1)(S k + T k ) 

The difference between the two regression contrast of the two 
preparations is called the parallelism contrast. 








279 


Experimental Designing and Data Analysis 


Parallelism contrast (L.,) 


JLJl ( S 1 - T,) - i-i(S 2 - T 2 )....-i!i-l(S k - T k ) 


when k is odd. 
and when k is even 


Li = -(k-IHS, -T 1 )-(k-3) (S 2 -T 2 )... + (k-1) (S k -T k ) 
Here it can be seen that 


y«-yt = 



When k is odd, b - ~ ,, 2 —r and 

kn(k -1) 


When k is even, b =- - - 

2kn(k 2 -1) 


The following analysis variance can be written. 

Analysis of variance in 2k point assays for validity tests 


Source of Variation 

d.f. 

s^. 

m.s f 

Preparation (Lp) 

1 

Lp / 2kn 


Combined Regression (L,) 1 

l]lD 


Parallalism (L,') 

1 

l, 2 /d 

s 2 b s 2 b/s 2 e 

Deviation from regression 2k-1 

By Subtraction 

s 2 d s 2 d/s 2 e 

Doses 

2k-1 

IS? + ST, 2 {I(Si 

+ T,)} 2 



n 

kn 

with in doses (error) 

2k(n-1) 

i By subtraction 

s 2 e 

Total 

2kn-1 

ZysPr+Iy?q r - 

HSi+Ti) 2 



Pr q r 

kn 

Here D = kn (k 2 -1) /6 when k is odd and 
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For testing linearity of regression, the mean square for the deviation 
from regression is tested by F test. For testing parallelism the "parallelism” 
components is tested. 

If both these tests are not significant, then the relative potency can 
be calculated 


logR 


x c - x t - 


^ y s -yt 


= logs-log 


, Lp kn(k 2 -l)j 
+ kn 6L 


when k is odd. 


R 


=-antilog 


d(k 2 -1) LP 
6 L, 


Here d = loglf^ 
When k is even 


logR = log-j- + 


L p 2kn( k2 

kn 3L| 



R = y anti log 



Here we can use CRD or RBD depending upon the availably of 
homogeneous groups of experimental material. These designs are not 
applicable when k is greater than 3 or 4. Usually small animals Like rats, 
cats, guina piga, etc. are used as experimental units is bio assays. 
These can be formed in to homogeneous groups of required size by 
equalizing them in respect of age, breed, housing and other management, 
etc. to form blocks. If the number of doses is small a latin square design 

can also be adopted to increase further the precision of estimate of the 
relative potency. 

It is observed that in a 4 point assay a test of linearity of regression is 
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not available, as the deviation from the regression contrasts does not 
exist in the analysis of variance. In a 6 point assay there are 2 degree of 
freedom for deviation from regression. The following two contrasts of the 
dose totals are the two deviation contrasts. 

Quadratic (L 2 ) combined = (S 1 -2S 2 + S 3 ) + (T 1 - 2T 2 + T 3 ) 

Difference between quadratic = (S 1 -2S 2 + S 3 ) + (T 1 - 2T 2 + T 3 ) 

In general the contrast L n is represented by the n-th degree polynomial. 
The difference between two such polynomials is denoted as L n '. 

Incomplete Block Designs for Bio-Assays 

When the number of doses is large it may not always be possible to 
get suitable homogeneous groups of experimental units for adopting 
randomized block designs. If litters are used as blocks a sufficient numbers 
of litters of required size may not be available even if the number of 
doses is not very large. This forces one to use incomplete block designs. 

The use of incomplete blocks design in bio assays is limited due to 
inflexibility of existing incomplete block designs including balanced 
incomplete block-(BIB) designs. These designs aim at estimating all 
paired treatment differences, while in bio assays all contents are not of 
equal importance. In symmetrical parralled line assays and slope ratio 
assay only two major type of contrasts are of interest. The contrasts are 


i) The difference between the totals of standard and test preparations 
(the preparation contrasts L p ) 

ii) The pooled estimate of slope (combined regression contrast L 1 ) 


By designing properly it is possible to estimate the preparation and 
regression contrasts free from the block effects. In such designs the two 
contrast L p and L 1 can be estimated by the unadjusted dose totals a£ in 

randomized block designs. Let s 1( s 2 .s^ denote K doses (on 

the log scale) of standard preparation arranged in ascending order and 

.^test preparation are arranged is descending order 

of magnitude, the constant difference for successive log doses is the 
same for both the parameters. Let Sj and T-. denote the response totals 

corresponding to the doses S-, and T j (i,j= J i, 2 .k) The important 

contrast are (1) preparation contrast: H 


l p -Z(Tj -Sj) 
i=1 

The next contrast is combined regression contrast L 1 which is the 
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sum of the linear contrasts of dose totals of the two preparations. The 
two contrasts L p and L 1 are essential for estimating the relative potency. 
The other contrasts provide validity tests and are known as parallelism 
contrast and are denoted by L 1 ’ (difference between two linear contrasts) 
l_ 2 and L 2 ’ (sum and difference of the quadratic contrasts of the two 
doses totals) and pair of the contrasts of the type Lm and Lm’ (sums and 
difference of the m th power contrasts among the dose totals of the two 
preparations). 

When an incomplete block design is used, the blocks are no longer 
orthogonal to the doses and at least some of the defined contrasts is 
not free from the block effects. Dose effects must therefore, be estimated 
by the method appropriate to the design and used in place of the 
unadjusted dose totals to calculate the contrast. 

Some series of incomplete block designs for symmetrical parallel 
line assay from which the preparation contrast L p and combined regression 
contrast L 1 can be obtained from the unadjusted dose totals as in the 
case of complete block designs. Take an incomplete block design for K 
doses of standard preparation in block size k' (k' < K). The final design 
is obtained by augmenting each block with k' of test preparation with the 
rule that if s 1 doses of standard preparation is in the block then t 1 of the 
test preparation must be included in the block. Thus we have an 
incomplete block design for bio assays in 2k' units per block from which 
both preparation and combined regression contrasts can be obtained 
from the unadjusted totals. The preparation and combined regression 
contrasts can be obtained from the unadjusted totals. The preparation 
contrast is independent of the block effect because equal number of 
doses of both preparations is occuring in the same block of the design. 
Since the dose totals and T-j have the same coefficient but with the 
opposite sign, in the combined regression contrast, this contrast can be 
expressed as a weighted totals of within block contrast and is therefore, 
free from the block effects. For the same reason the contrasts L 2 +1 and 
lyn + 2 are estimated free from the block effects. The other constants 
^ 2n+i and L 2n+2 are not free from the block effects and therefore, must 
be estimated by the methods appropriate to the design. If is original 
design is BIB design then all the estimated contrasts are uncorrelated. 

Example : An 8-point parallel line assay. 

Take the BIBD with K=4, k' =2 and use s 1 , s 2 , s 3 and s 4 as treatment. 

Blocks 1 s 1 s 2 
2 s 1 s 3 
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4 S 2 S 3 

5 s 2 s 4 

6 s 3 s 4 

The final design in four plot block along with the doses of the test 
preparation is: 

Blocks 1 s 1 s 2 t 1 t 2 

2 s 1 s 3 t 1 t 3 

3 s 1 s 4 t 1 t 4 

4 s 2 S 3 t 2 1 3 

5 ^2 ^4 ^2 ^4 

6 S 3 s 4 131 4 

The coefficient of the dose totals in the different contrasts of the 8 
point assay are 

Table : Coefficient to the dose totals in the different contrasts of 

an 8 point Assays 


Doses Standard preparation Test Preparation 


_ s i ^2 % 

Dose Totals S 1 S 2 S 3 

Con- Lp -1 -1 -1 

trasts L 1 -3 -1 1 

L,' -3 -1 1 

L 2 1 -1 -1 

L 2 ' 1 -1 -1 

L 3 1 -3 3 

L 3 ’ 1 -3 3 



11111 


3 3 1-1-3 

3-3-11 3 

11 - 1-11 
1 - 111-1 
-1 -13-3 1 

-11 -3 3 -1 


Except Lp, other contrast are orthogonal Tables (Fisher and Yates 
(1963)). 

The estimate of regression contrasts from the dose totals is: 

L 1 = -3(S>j-T!) — (S 2 -T 2 ) + (S 3 -T 3 )+ 3 (S 4 - T 4 ). 

Here S s — Tj (i = 1 ,2, 3,4) occur in the same block, Sj - T, is free from 
block effect. Lp which is a function of Sj - Tj is also estimated free from 
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the block effect. From the same reason L 2 ' and L 2 are also free from the 
block effects. Here all contracts lJ 2n and L 2 2n-1 for different values of n 
are estimated free from the block effects. The other contrasts. L^, l 2 
and l_ 3 ' are not free from blocks. To estimate them the individual dose 
effects are first estimated. These estimated dose effects are then 
substituted for the unadjusted dose totals in the affected contrasts. 

Their sum of squares is found by squaring each of them and then 
dividing by the coefficient of a 2 in the variance of the contrasts of the 
dose effects. If the initial design is BIBD the estimate of the affected 
contrasts obtained as indicated above are mutually orthogonal. By adding 
the sums of squares due to all these contrasts including the unaffected 
ones, the adjusted dose sum of squares is obtained. The error sum of 
square can now be obtained by subtraction. 

In case of other incomplete block designs, the estimates of the 
affected contrasts may not he orthogonal. In these error sum of squares 
has to be obtained from the complete analysis of the final incomplete 
designs adopted for the assay. 

Analysis of Parallel Line Assays Based on BIB Designs 

The analysis of parallel line assays is essentially in two parts. The 
first part consists of computation of the analysis of variance and the 
second part consists of estimating the relative potency, its variance, and 
limits. 

The expression for calculating sum of square due to any contrast 
unaffected by block effect is 



Where l-j, l 2 .the coefficient of the contrasts and r is the number 

of replications. The sum of square due to other contrasts are calculated 

from least squares estimates s v s 2 , .s k and t, t 2 ,.t k of the effects 

of various doses of the standard and test preparations (we use the same 
symbols for doses and their effects as there is no confusion). The details 
of analysis for designs based on BIB design is given below. 

The adjusted totals are 
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of=s i -Zv i 

j(i) 

Q/ = T,-Iy, 

j(i) 


Where yj (j - 1,2,.. b) is iUejth block average and summation 

is taken over all the block containing dose i of the preparations. All the 
required contrasts are functions Q's. The unaffected contrasts are 
functions of Q S j - Q t j (= S s - Tj); the sum of squares for this are conveniently 
calculated from unadjusted dose totals. The remaining contrasts are 
function of 

Qi = Q® + Q l 


The s.s. due to any affected contrast 



is given by 



where E = (rk’ - r + A.)/rk\ k’ being the block size of the starting 
design and X the number of blocks in each of which any two given doses 
of standard preparation occur together. 


The variance of the affected contrast is 




2EI?/rE 


V i 


) 


2 

a 2 k, where j s 


the error variance. If a randomized block design is used then the 

/ \ 

.2 


corresponding variance is 


21 If /r 

V i J 


| CT 2k where a|k denotes the error 


variance in complete blocks with 2k units. If one uses a BIB design with 

t \ 


2k doses, block size K 1 then the variance 


21 If /rE 1 

V i 


a 2 k where E is 


the efficiency factor of BIB design and is its error variance. The 
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efficiency of the present design in respedt of affected contrast is E / 

a| k . when compared to randomized blocks design and {E/E) (a k . / j 

when compared with the basic BIB design. 

* f f 

An Example : For illustrating the method of analysis of a 6 point 
symmetrical parallel line assay Bliss (1952. p. 498) presented a body of 
data collected on a vitamin D assay, the design used 12 litter of 6 rats as 
randomized blocks. Here we used these data with some modifications. 
To ensure comparability of the estimate of relative potency all observation 
were used, but were fitted into an incomplete block design of the present 
series by omitting two observations from each of the original block (litters), 
as shown by blanks in table below, and forming 6 additional block (13- 
18) from the 24 observations omitted, ignoring litter differences, but 
retaining the dose-observation relations. 


Table : Data and BIB based on Parallel Line Assay 

(Doses in mg.) 


Block 


Standard 



Test 


Block 

Number 

S 1 

S 2 

S 3 

*1 

*2 

*3 

Total 


2.5 

5 

10 

2.5 

5 

10 


1 

2 

8 

- 

- 

9 

7 

26 

2 

6 

- 

9 

3 

- 

8 

26 

3 

- 

6 

12 

4 

6 

- 

28 

4 

9 

11 

- 

- 

14 

13 

47 

5 

10 

- 

17 

8 

- 

10 

45 

6 

- 

7 

5 

6 

9 

- 

27 

7 

4 

10 

- 

- 

11 

13 

38 

8 

11 

- 

9 

3 

- 

15 

38 

9 

- 

9 

14 

5 

8 

- 

36 

10 

4 

7 

- 

- 

10 

10 

31 

11 

12 

- 

9 

15 

- 

15 

51 

12 

- 

8 

11 

7 

8 

- 

34 

13 

4 

4 

- 

- 

5 

9 

22 
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14 

7 


8 

3 

- 

9 

27 

15 

- 

15 

10 

6 

8 

- 

39 

16 

2 

4 

- 

- 

6 

6 

18 

17 

4 

- 

13 

5 

- 

12 

34 

18 

- 

10 

13 

4 

18 

- 

45 

Totals 

75 

99 

130 

69 

112 

127 

612 


Si 

s 2 

s 3 

Tl 

t 2 

t 3 


Adjusted 

-25.75 

1.25 

22.50 

-31.75 

14.25 

19.50 

0 

Total 

Q? 

Qf 

Qf 

oj 

Q l 

Q l 



S.S. due to block = 358.00 contrast for 6 points parrallel line assay. 

Table : Contrast, Divisor and Sums of Squares for Data of Table 


Value of 
Contrast 

Divisor for SS 

Sum of Squares 

Unaffected contracts 

Lp 4 

6x12 = 72 

0.22 

L, 113 

4 x 12 = 48 

266.02 

L’ 2 -35 

12x12 = 144 

8.51 

Affected Contrasts 

L 1 -3.00 

4 x 9 = 36 

0.25 

l_ 2 -46.50 

12x9 = 108 

20.02 

Total 


295.02 


For this design, r = 12, k’ = 2 and X = 6, whence E = (rk’ - r + X) I k’ 
= 9. The analysis of variance is in the following table. 


As in the original analysis, all the validity tests are satisfied. Though 
the estimate of relative potency must remain identical with the original 
estimate, the mean squares due to error and the affected contrast will be 
different. After regrouping the data there are 49 d.f. for error, of which 10 
may be contaminated by differences between the original litters. The 


mean square for the contaminated component is 6.11 while that for the 
remaining 39 d.f. is 7.23: the error mean square in the original analysis 
was 7.22 with 55 d.f. Thus grouping of observations on animals from 
different litters into the same block has not increased the error mean 
square (as might have been expected from the significance of the between 
block means square). The estimate of the relative potency is 

R = — anti log « ^-.^-1 = anti log 
2.5 [3 L 1 J 

where d = log 10 2 

Table : Analysis of Variance Table 


Nature of variation 

d.f. 

S.S. 

m.s. 

F 

Between Blocks 

17 

358.00 

21.06 

3.01 ** 

Preparations 

1 

0.22 

0.22 


Regression 

1 

266.02 

266.02 

38.00 ** 

Parallelism 

1 

0.25 

0.25 

< 1 

Comb. Quadratic 

1 

20.02 

20.02 

2.86 NS 

Diff. Quadratic 

1 

8.51 

8.51 

1.22 NS 

All Contrasts 

5 

295.02 



Error (by subtraction) 

49 

342.98 

7,00 


Total 

71 

996.00 
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EXPECTED VALUE OF MEAN 
SQUARE IN FIXED, RANDOM 
AND MIXED MODELS 


At the planning stage the experimenter must decide whether the levels 
of factors are to he taken as fixed or are to be chosen at random from 
many possible levels. This decision must be made prior to the running of 
the experiment and if random levels are to be used, they must be chosen 
from all possible levels by a random process. In the case of random 
levels it will be assumed that the levels are chosen from an infinite 
population of possible levels. When all the levels are fixed, the 
mathematical model of the experiment is called a fixed model. When all 
levels are chosen at random, the model is called a random model. When 
several factors are involved, some at fixed levels and others at random 
levels, the model is called a mixed model. 

Single factor experiment 


If the design is completely randomized, the model is x = u + T + p 

\\ " j 

Whether the treatment levels are fixed or random. It is assumed that 
M is a fixed constant and the errors are normally and independently 


distributed with zero mean and same variance e^ ~ N(0, CT 2 ). The 
assumption under fixed and random model will be. 

Fixed model Random Model 


1 • Assumption: 

Tj's are fixed constants 

M j=1 


1. Assumption: Tj’s are random 

variables with mean zero and 
constant variance a* 
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(These add to zero as 

they are the only treatment 
means being considered, 


2. Analysis : Procedure is the 
same as given in earlier chapter 
for computation of SS 

3. EMS (Expected value of mean 
square) 


(Here gj represents the 

variance among or among 

the true treatment means 

The Tj s average to zero when 

averaged over all possible 
levels, but for the K levels of 
the experiment they usually 
will not average to 0) 

2. Analysis : same as for fixed 
model 

3. EMS (Expected value of mean 
square) 


Source 

d.f. EMS 

Source df. EMS 

Treatment 

k - 1 hf 

2 j=1 

e k-1 

Treatment k-1 a 2 + no 2 


Error k(n-1) cr 2 

4. Hypothesis tested 

H 0 : Tj = 0 for all j 


Error k(n-1) g\ 

4. Hypothesis tested 

H 0 : a 2 = 0 


The EMS (expected mean square) column is extremely important in 
deciding how to set upon F test for significance. For the fixed model if 
the hypothesis is true that is Tj = 0 for all j i.e. all the fixed treatment 
means are equal then IT 2 - = 0 and EMS for Tj and error, are both a 2 e. 

Hence, the observed mean squares for treatment and error mean 
square are both estimates of error variance and they can be compared 
by means of a F-test. If this F test shows a significant high values, it 


must mean that 


nIT 2 

k-1 


is not zero and the hypothesis is to be rejected. 


The expected mean square for any term in the model is the long 
range average of the calculated mean square when the Xy from the model 
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is substituted in algebraic form in to the mean square computation. 

For the random model, if the hypothesis is true.a^- = 0 i.e. variance of 
all treatment means is zero, then again each mean square is an estimate 
of error variance. Again an F-test between the two mean square is 
appropriate. 

So it is seen that for a single factor experiment there is no difference 
in the test to be made after the analysis and the only difference is in the 
generality of the conclusions. If Ho is rejected, there is probably a 
difference between the K fixed treatment means for the fixed model; for 
the random model there is a difference between all the treatments of 
which the K examined are but a random sample. The EMS for treatment 
effect is often written as + na^whether the model is fixed or random. 

Two factor model: 

For two factors A and B the model is in case of RBD. = p + Aj + B: 

+ (AB)jj + e k(j j } i =1,2 ...... a, j = 1, 2.b k=1,2 .n. 

Again p is a fixed constant and e k ^ are normally and identically 

distributed with mean O and variance both A and B are at fixed 

levels, the model is a fixed model. If both are at random level the model 
is a random model, and if one is at fixed levels and the other at random 
levels, the model is a mixed model, comparising of each of these models 


gives. 

Fixed 

Random 

Mixed 

1. Assumption 

Assumption 

Assumption 

Aj's are fixed 

Aj's are NID(o, a* ) 

Aj's are fixed 

constants 



> 

n 

o 


> 

n 

o 

Bj's are fixed 

Constants and 

Bj's are NID (o g* ) 

Bj are NID (0, cr|) 

o 

ii 

ccT^ 

■° W.l. 



(AB)y are fixed 

(AB)y are NID (o, cr^ 

(AB)^ are NID 





Constants and 

(o, o^) and 

ZZ(AB)ij=0 

i=ij=i 

0 

11 

m 

< 


Z(AB),*o 

j=1 


(for A fixed and B 

• 

random) 

Analysis Same 

Same 


Procedure is same 
as described earlier 
for sums of square 


EMS EMS EMS 

Source d.f EMS (fixed) EMS (Random) EMS (Mixed) 


IA 


<7o+nb— 1 —- a e + n<J AB + n ka A <7o+naAB+ n b— 


ZA| 


a -1 


a-1 


B; 


Z B j 

k -1 al+na— — 
e b-1 


a e + ncy AB +naoe 


2 2 
<j e + no B 


(AB) a 


n ZSABf 
(a-1 )c 2 e + 1 j 

(b-1) 


(a - 1)(b -1) 


cl + ncl B 


2 2 
"e + n<J AB 


e k(i,j) a «>(n-1) cr 


a, 


4 . 


Hypothesis tested 
H 1 : Aj = 0 for all i 
H 2 : Bj = 0 for all j 
H 3 : ABy = 0 for all i and j 
Summing the interaction 


H 1 : a 2 A = 0 
H 2 : a 2 B = 0 
H 2 : a 2 AB = 0 
term over the fixed 


H 1 : Ai = 0 for all i 

^2• a B = 0 

H 3 : a AB “ 0 
factors (Li)is zero but 
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summing it over the random factor (Ij) is not zero affects the expected 
mean squares. 

For the fixed model, the mean square for A, B and AB are each 
compared to the error mean square to test the respective hypothesis. 
For the random model the third hypothesis of no interaction is tested by 
comparing the mean square for interaction to the mean square for error. 
But the first, second hypothesis are each tested by comparing the mean 
square for main effects (Ai or Bi) with the mean square for the interaction 
as can be seen by their expected mean square values. For the mixed 
model, the interaction hypothesis is tested by comparing the interaction, 
mean square by the error mean square. The random effect (Bj) is also 
tested by comparing its mean square with the error mean square. The 
fixed effect (Ai), are however tested by comparing its mean square with 
the interaction mean square. 

So it can be seen that how important is the EMS column, it tells how 
the tests of hypothesis will be done. So it is important that these EMS 
expression are determined prior to the running of the experiment. In some 
cases the proper test indicated by EMS column will show insufficient 
degree of freedom and where the investigators might like to change the 
experiment. 

EMS rules 

Because of the importance of EMS column in determining what tests 
of significance are to be done after the analysis is completed. In complex 
models this EMS column is again very important and to find the EMS 
value it is very difficult so it is useful to have some simple method of 
determining the EMS values from the models of the given experiment 
(Hicks, 1964). Aset of rules will determines the EMS column very rapidly 
without recourse to their derivation. To determine the EMS column for 
any model: the rules will be illustrated for the two factor mixed model as 
given in Hicks(1964). 


1. Write the variable terms in the model as row heading in a two 
way table 



e k(ij> 
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subscript write F if the factor levels are fixed, R if random; also 
write the number of observation each subscript is to cover. 

a 

b 

n 

F 

R 

R 

i 

j 

k 

a 



B i 

(AB), 



e k(ij) 



3. For each row (each term in the model) copy the number of 
observations under each subscript, providing the subscript does 
not appear in the row heading. 

a 

b 

n 

F 

R 

R 

i 

j 

k 

a 

b 

n 

B i a 


n 


■—^ ■ ■ 
(AB)jj n 

e k(ij) 


4. For any bracketed subscript in the model place al under those 
subscripts which are inside the brackets. 



a 

b 

n 


F 

R 

R 


i 

j 

k 

Ai 


b 

n 

Bj 

a 


n 

(AB), 



n 

e k(ij) 

1 

1 



5. Fill the remaining cells with 0 or 1, depending upon whether the 
subscript represents a fixed (F) or a random (R) factors. 
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a 


b 


n 


F 


R 


R 


i 


j 


k 

Ai 

0 

b 


n 


Bj 

a 

1 


n 


(AB)jj 

0 

1 


n 


e k(ij) 

1 

1 


1 



6. To find the expected mean square for any term in the model. 
Cover the entries in the column (or columns) which contain non 
bracketed subscripted letters in this term in the model ( e.g. for 
Ai, cover column i; for e k ^ cover column k) 

Multiply the remaining numbers in each row. Each of these products 
is the coefficient to its corresponding term in the model provided the 
subscript on the term is also a subscript on the term whose expected 
mean square is being determined. The sum of these coefficient multiplied 
by the variance of their corresponding terms is the expected mean square 
of the term being considered (e.g., forAi, cover column i). The products 
of the remaining coefficients are bn, n, n and I, but the term n is not 
used, as there is no i in its term (Bj). The resulting expected mean 
square is then (bn a 2 A + na 2 AB + 1 ,<j 2 e). For all terms these rules give: 



a' 

F 

i 

b 

R 

j 

n 

R 

k 

EMS 

A 

0 

b 

n 

a e + ncy AB +nb(4 


a 

1 

n 

2 2 
a| +naag 

(AS), 

0 

1 

n 

a e + na AB 

e k@ 

1 

1 

1 

cl 


Za; 


Here is a fixed type of variance <j 2 = J=! 


a-1 


The application of these rules becomes very easy to use after a bit of 
practice. 
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Example: 

Consider a6x4x5x2 factorial experiment. Here the first factor Oi 
has random levels (6 being chosen). Factor A has 4 fixed levels, C k has 
5 fixed levels and (L M ) has two fixed levels. There were six replications 
and the design used is CRD. The expected mean square can be 
determined from the rules given in earlier section. 

EMS values 



6 

4 

5 

2 

6 



R 

F 

F 

F 

R 


Model 

i 

j 

k 

m 

q 


Oi 

1 

4 

5 

2 

6 

e\ + 240 ctq 


6 

0 

5 

2 

6 

CTe + 60 <?0A + 360 CTA 

OAj 

1 

0 

5 

2 

6 

el + 60 <r§ A 

Ok 

6 

4 

0 

2 

6 

<yj + 48 cTg C + 288 gq 

o 

o 

1 

4 

0 

2 

6 

el + 48 <j^ c 

AC jk 

6 

0 

0 

2 

6 

el + 12 <4ac + 72o ac 

OACjj k 

1 

0 

0 

2 

6 

el +12 °oac 

L 

6 

4 

5 

0 

6 

CTg + 120 (Jol+ 720 gl 

DL im 

1 

4 

5 

0 

6 

Ge + 120 gq L 

A *-jm 

6 

0 

5 

0 

6 

el + 30 cr„ AL + 180 o AL 

OAL, m 

1 

0 

5 

0 

6 

el + 30 v 2 m_ 

CLkn, 

6 

4 

0 

0 

6 

e\ + 24 Oqcl + 144 <jq l 

OOLjot 

1 

4 

0 

0 

6 

+ 24 ct ocl 

AOLjkm 

6 

0 

0 

0 

6 

el + 6 CTo ACL + 36 a ACL 

OACL|j km 

1 

0 

0 

0 

6 

+ 6 °oacl 

Eq (i, j, k, m) 

1 

1 

1 

1 

1 

2 

a e 
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Here it is easily seen that all interaction involving factor O and main 
effect O are tested against the error mean square at the bottom of the 
table. All interaction and the main effects involving fixed factors are tested 
by the mean square just below them in the table. However algebraic 
derivations of EMS is also shown as below. 

EMS Derivations : 

Single factor experiment. The model is 

x i j = H + T j + e ij (i) 

Here the sum of squares for the treatment effect is given as: 

SST = Zn(x.j-x ..) 2 
j=1 

Here the number of observation per cell per treatment is equal to n. 


n n (|i + T: +eii) 

x.j = Zxy /n = —--li 

m i=i n 

_. np nTj n 

x.j = — + — + Ze ii /n 


n n j=i 


_ n 

x i = ^ + Tj +S e ij /n 

i=1 


Also 


k n 


k n 


x -=SI x ii /nk = Sl(H + T i +e ij 

j=1i=1 j i 



X. 


nkp 

nk 


k ^ 


f k k 'N 

n^Tj / nk 

+ 

IXV nk 

k J =1 ; 


U-1i=1 J 



H + XTj/k + ££eij/nk 

j=1 j=1i=1 


\ 

J 



fk ) 


( n A 


f n k A 

Xl 

1 

XI 

’ll 

1 

1 

ZTj/k 

\i =1 J 

+ 

Z e ij/n 

V=1 J 

— 

IZ e ij/nk 

V =1 j=1 ) 


we have 
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Expected 

squaring gives 


I- 

f k Y 

n2 

1 

+ ~2 

n n k 1 

V 

Z^/k 

Z e ij _ Z Z e ij / k 

U =1 /. 

n 

_i=1 ' j 


(xj-x..) 2 = 


+ crossproduct term. 

Multiplying by n and summing over j gives. 


SST = £n(xj-x.) 2 =n£ 

j=1 j=1 


Tj-ZTj/k 

H 


n 


n' 


I 

j 


n 

Z e ij 

i=1 


^ n k 

ZZV k 

1 J J 


-\2 


+ n£ (crossproduct) 


E(SST)= nE 


n 


f 




Tj ZTj/k 


j=\ H 


1 

+ — 

n 



r 

k 

n 

^ n n ^ 


2' 

E< 

I 

Z e ij “ 

IZe,/k 


> 


lj=1 

L 

i=1 

o-i H J 


J 


as it can be shown that expacted values of crossproduct term is 
equal to zero. 

It the treatment levels are fixed. 


E T j =° 
j=i 

and E(SST) become 

E(SST) = n£Tj 2 + —(nk-n)<Tg 
H n 


Error component are random and Z Tj 2 is a constant. The E(SST/ 

j=i 


k-1) so 


E(MST) = 


/ k o ' 

n Z T| 2 /k-1 


^ H 


y 


n(k -1) _ 2 
n(k -1) * 
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If however the treatment levels as random. 

k 


SVo 

i=i 



= (n - Xpl 


E(MST) = - (k 1)o t + 2 
(k-1) e 

For the error mean squre 
SSE = l£( Xij -*j) 2 

j=1i=1 

we have 

x ij-x-j = e ij -j;e ij /n 

i=1 

Squaring and adding 



Taking the expected value, we have. 
E(SSE) =ESEk-(se ij /n)| 2 

j=1i=1 J 

= ZEE[e l i-(2e ij /n)] 2 

j=1 i J 

= S( n -1)«Je =k(n-1)o| 

i-i 


E(MSE) = 



SSE 'j 
K(n-1), 


= a 


2 

e 


So the expected values are coming out to the same as we have 
shown in table using simple rules. So there is a advantage of simple 
rules in determining the EMS values. 


□□□ 
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